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On metric Ramsey-type phenomena 
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Abstract 



^Sj . The main question studied in this article may be viewed as a nonlinear 

analogue of Dvoretzky's theorem in Banach space theory or as part of Ramsey 

r*n I theory in combinatorics. Given a finite metric space on n points, we seek its 

subspace of largest cardinality which can be embedded with a given distortion 
in Hilbert space. We provide nearly tight upper and lower bounds on the 
cardinality of this subspace in terms of n and the desired distortion. Our main 
2 ' theorem states that for any e > 0, every n point metric space contains a subset 

of size at least n^~'' which is embeddable in Hilbert space with O i °^^^ '^' 1 
distortion. The bound on the distortion is tight up to the log(l/e) factor. We 
^ ■ further include a comprehensive study of various other aspects of this problem. 
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1. Introduction 

The philosophy of modern Ramsey theory states that large systems neces- 
sarily contain large, highly structured sub-systems. The classical Ramsey col- 
oring theorem [49], [29] is a prime example of this principle: Here "large" refers 
to the cardinality of a set, and "highly structured" means being monochro- 
matic. 

Another classical theorem, which can be viewed as a Ramsey-type phe- 
nomenon, is Dvoretzky's theorem on almost spherical sections of convex bod- 
ies. This theorem, a cornerstone of modern Banach space theory and convex 
geometry, states that for all e > 0, every n-dimensional normed space X con- 
tains a fc-dimensional subspace Y with d(y, £2) ^ 1 + £; where k > c(e) logn. 
Here d{-, •) is the Banach-Mazur distance, which is defined for two isomorphic 
normed spaces Zi,Z2 as: 

(i(Zi,Z2) =inf{||r|| • ||T~i||;TG GL(Zi,Z2)}. 

Dvoretzky's theorem is indeed a Ramsey-type theorem, in which "large" is 
interpreted as high-dimensional, and "highly structured" means close to Eu- 
clidean space in the Banach-Mazur distance. 

Dvoretzky's theorem was proved in [24], and the estimate k > c(e)logn, 
which is optimal as a function of n, is due to Milman [44]. The dimension 
of almost spherical sections of convex bodies has been studied in depth by 
Figiel, Lindenstrauss and Milman in [27], where it was shown that under some 
additional geometric assumptions, the logarithmic lower bound for dim(y) in 
Dvoretzky's theorem can be improved significantly. We refer to the books 
[46], [48] for good expositions of Dvoretzky's theorem, and to [47], [45] for an 
"isomorphic" version of Dvoretzky's theorem. 

The purpose of this paper is to study nonlinear versions of Dvoretzky's 
theorem, or viewed from the combinatorial perspective, metric Ramsey-type 
problems. In spite of the similarity of these problems, the results in the metric 
setting differ markedly from those for the linear setting. 

Finite metric spaces and their embeddings in other metric spaces have 
been intensively investigated in recent years. See for example the surveys [30], 
[36], and the book [42] for an exposition of some of the results. 

Let f : X ^ Y he an embedding of the metric spaces {X, dx) into (Y, dy)- 
We define the distortion of / by 

,- .(f. dY{f{x),f{y)) dx{x,y) 

dist(/) = sup -— 7 • sup 



x,yGX dxix,y) x,yexdY{f{x),f{y))' 

Xy^y Xy^y 

We denote by cy{X) the least distortion with which X may be embedded in Y. 
When cy{X) < a we say that X a-embeds into Y and denote X ^^ Y. When 
there is a bijection / between two metric spaces X and Y with dist(/) < q we 
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say that X and Y are a-equivalent. For a class of metric spaces A4, cm{^) 
is the mininiuni a such that X a-embeds into some metric space in ^A. For 
p > 1 we denote q {X) by Cp{X). The parameter C2{X) is known as the 
Euclidean distortion of X. A fundamental result of Bourgain [15] states that 
C2{X) = O(logn) for every n-point metric space {X,d). 

A metric Ramsey-type theorem states that a given metric space contains 
a large subspace which can be embedded with small distortion in some "well- 
structured" family of metric spaces (e.g., Euclidean). This can be formulated 
using the following notion: 

Definition 1.1 (Metric Ramsey functions). Let ^A be some class of met- 
ric spaces. For a metric space X, and a > 1, i?;K(X;a) denotes the largest 
size of a subspace Y oi X such that cm {Y) < a. 

Denote by R_M{a,n) the largest integer m such that any n-point metric 
space has a subspace of size m that a-embeds into a member of A^. In other 
words, it is the infimum over X, \X\ = n, of Rjii{X; a). 

It is also useful to have the following conventions: For a = 1 we allow 
omitting a from the notation. When Ai = {X}, we write X instead of A4. 
Moreover when Ai = {ip}, we use Rp rather than R^^. 

In the most general form, let M he a class of metric spaces and denote by 
RM{J^',o:,n) the largest integer m such that any n-point metric space in Af 
has a subspace of size m that a-embeds into a member of M. In other words, 
it is the infimum over X £ M, \X\ = n, of Rm{X; a). 

1.1. Results for arbitrary metric spaces. This paper provides several re- 
sults concerning metric Ramsey functions. One of our main objectives is to 
provide bounds on the Euclidean Ramsey Function, R2{a,n). 

The first result on this problem, well-known as a nonlinear version of 
Dvoretzky's theorem, is due to Bourgain, Figiel and Milman [17]: 

Theorem 1.2 ([17]). For any a > 1 there exists C{a) > such that 
R2{oi,n) > C(a)logn. Furthermore, there exists oq > 1 such that R2{aQ,n) = 
O(logn). 

While Theorem 1.2 provides a tight characterization of R2{a, n) = 0(log n) 
for values of a < ao (close to 1), this bound turns out to be very far from the 
truth for larger values of a (in fact, a careful analysis of the arguments in [17] 
gives ao ~ 1.023, but as we later discuss, this is not the right threshold). 

Motivated by problems in the field of Computer Science, more researchers 
[32], [14], [5] have investigated metric Ramsey problems. A close look (see 
[5]) at the results of [32], [14] as well as [17] reveals that all of these can be 
viewed as based on Ramsey-type theorems where the target class is the class 
of ultrametrics (see §3.1 for the definition). 
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The usefulness of such results for embeddings in (.2 stems from the well- 
known fact [34] that ultrametrics are isometrically embeddable in H.2- Thus, 
denoting the class of ultrametrics by UM, we have that R2{a, n) > RuMid, n). 

The recent result of Bartal, Bollobas and Mendel [5] shows that for large 
distortions the metric Ramsey function behaves quite differently from the be- 
havior expressed by Theorem 1.2. Specifically, they prove that R2{a,n) > 
R\JM{o:,n) > exp ((logn)"*^" '^'°)) (in fact, it was already implicit in [14] that 
a similar bound holds for a particular a). The main theorem in this paper is: 

Theorem 1.3 (Metric Ramsey-type theorem). For every e > 0, any 
n-point metric space has a subset of size n^^^ which embeds in Hilbert space 
with distortion O I °^^^ '^' J . Stated in terms of the metric Ramsey function, 
there exists an absolute constant C > such that for every a > 1 and every 
integer n: 

R2{a,n) > RuM{o:,n) > n ° . 

We remark that the lower bound above for i?uM(a,n) is meaningful only 
for large enough a. Small distortions are dealt with in Theorem 1.6 (see also 
Theorem 3.26). 

The fact that the subspaces obtained in this Ramsey-type theorem are 
ultrametrics in not just an artifact of our proof. More substantially, it is 
a reflection of new embedding techniques that we introduce. Indeed, most 
of the previous results on embedding into ip have used what may be called 
Frechet-type embeddings: forming coordinates by taking the distance from a 
fixed subset of the points. This is the way an arbitrary finite metric space is 
embedded in £00 (attributed to Frechet). Bourgain's embedding [15] and its 
generalizations [41] also fall in this category of embeddings. However, it is 
possible to show that Frechet-type embeddings are not useful in the context 
of metric Ramsey-type problems. More specifically, we show in [6] that such 
embeddings cannot achieve bounds similar to those of Theorem 1.3. 

Ultrametrics have a useful representation as hierarchically well-separated 
trees (HST's). A fc-HST is an ulrametric where vertices in the rooted tree are 
labelled by real numbers. The labels decrease by a factor > k as you go down 
the levels away from the root. The distance between two leaves is the label of 
their lowest common ancestor. These decomposable metrics were introduced by 
Bartal [3]. Subsequently, it was shown (see [3], [4], [28]) that any n-point metric 
can be 0(logn)-probabilistically embedded^ in ultrametrics. This theorem has 
found many unexpected algorithmic applications in recent years, mostly in 



'^A metric space can be a-probabilistically embedded in a class of metric spaces if it is 
Qf-equivalent to a convex combination of metric spaces in the class, via a noncontractive 
Lipschitz embedding [4]. 
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providing computationally efficient approximate solutions for several NP-hard 
problems (see the survey [30] for more details). 

The basic idea in the proof of Theorem 1.3 is to iteratively find large sub- 
spaces that are hierarchically structured, gradually improving the distortion 
between these subspaces and a hierarchically well-separated tree. These hierar- 
chical structures are naturally modelled via a notion (which is a generalization 
of the notion of /c-HST) we call metric composition closure. Given a class of 
metric spaces Ai, we obtain a metric space in the class comp;i.(A^) by taking 
a metric space M G M and replacing its points with copies of metric spaces 
from comp j,{^A) dilated so that there is a factor k gap between distances in 
M and distances within these copies. 

Metric compositions are also used to obtain the following bounds on the 
metric Ramsey function in its more general form: 

Theorem 1.4 (Generic bounds on the metric Ramsey function). Let C 
be a proper class of finite metric spaces that is closed under: (i) Isometry, 
(ii) Passing to a subspace, (iii) Dilation. Then there exists 6 < 1 such that 
Rc{n) < n for infinitely many values of n. 

In particular we can apply Theorem 1.4 to the class C = {X; c;k(X) < a} 
where Ai is some class of metric spaces. If there exists a metric space Y with 
cjii{Y) > a, then there exists 6 < 1 such that R_\4{a,n) < n for infinitely 
many n's. 

In the case of £2 or ultrametrics much better bounds are possible, showing 
that the bound in Theorem 1.3 is almost tight. For ultrametrics this is a rather 
simple fact [5]. For embedding into £2 this follows from bounds for expander 
graphs, described later in more detail. 

Theorem 1.5 (near tightness). There exist absolute constants c,C > 
such that for every a > 2 and every integer n: 

Rvuictin) < R2{a,n) < Cn ~~. 

The behavior of i?uM(a, n) and R2{a, n) exhibited by the bounds in The- 
orems 1.2 and 1.3 is very different. Somewhat surprisingly, we discover the 
following phase transition: 

Theorem 1.6 (phase transition). For every a > 1 there exist constants 
c,C,c' ,C' ,K > depending only on a such that < c' < C" < 1 and for every 
integer n: 

a) If 1 < a < 2 then clogn < RuM{a,n) < R2{a,n) < 21og2 n + C. 

b) If a>2 then n"' < i?uM(a,n) < R2{a,n) < K n^' . 
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Using bounds on the dimension with which any n point ultranietric is 
embeddable with constant distortion in ip [7] we obtain the following corollary: 

Corollary 1.7 (Ramsey-type theorems with low dimension). There ex- 
ists < C{a) < 1 such that for every p > 1, a > 2, and every integer n, 

where C{a) > 1 — '^ '^" , d = ["[ faf 2)^ "l^('^) log^ ' '^^^ c, c' > are universal 
constants. 

This result is meaningful since, although £2 isometrically embeds 
into Lp for every 1 < p < 00, there is no known ip analogue of the Johnson- 
Lindenstrauss dimension reduction lemma [31] (in fact, the Johnson- 
Lindenstrauss lemma is known to fail in ii [19], [33]). These bounds are almost 
best possible. 

Theorem 1.8 (The Ramsey problem for finite dimensional normed spaces). 
There exist absolute constants C,c > such that for any a > 2, every integer 
n and every finite dimensional normed space X, 

Rx{a,n) < Cn ~~(dimX) loga. 

For completeness, we comment that a natural question, in our context, is 
to bound the size of the largest subspace of an arbitrary finite metric space 
that is isometrically embedded in ip. In [8] we show that Rp{n) = 3 for every 
1 < p < 00 and n > 3. 

Finally, we note that one important motivation for this work is the appli- 
cability of metric embeddings to the theory of algorithms. In many practical 
situations, one encounters a large body of data, the successful analysis of which 
depends on the way it is represented. If, for example, the data have a natural 
metric structure (such as in the case of distances in graphs), a low distortion 
embedding into some normed space helps us draw on geometric intuition in 
order to analyze it efficiently. We refer to the papers [4] , [26] , [37] and the sur- 
veys [30], [36] for some of the applications of metric embeddings in Computer 
Science. More about the relevance of Theorem 1.3 to Computer Science can 
be found in [9] (see also [5], [10]). 

1.2. Results for special classes of metric spaces. We provide nearly tight 
bounds for concrete families of metric spaces: expander graphs, the discrete 
cube, and high girth graphs. In all cases the difficulty is in providing upper 
bounds on the Euclidean Ramsey function. 

Let G = iy, E) be a d-regular graph, d > 3, with absolute multiplica- 
tive spectral gap 7 (i.e. the second largest eigenvalue, in absolute value, of 
the adjacency matrix of G is less than 7^). For such expander graphs it is 
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known [37], [41] that C2{G) = ^^^dO-Og\V\) (here, and in what follows, the 
notation o„ = 0,(bn) means that there exists a constant c > such that for 
all n, |a„| > c|5„|. When c is allowed to depend on, say, 7 and d we use the 
notation 0,-y^(i)- In Section 5 we prove the following: 

Theorem 1.9 (The metric Ramsey problem for expanders). Let G = 
(y, E) be a d-regular graph, d > 3 with absolute multiplicative spectral gap 
7 < 1. Then for every p G [1, cxd), and every a > 1, 

\V\ °'°«.(i/-) < i?2(G;a) < Rp{G;a) < Cd\V\^ ^ , 
where C,c> are absolute constants. 

The proof of the upper bound in Theorem 1.9 involves proving certain 
Poincare inequalities for power graphs of G. 

Let Qd = {0, l}'^ be the discrete cube equipped with the Hamming metric. 
It was proved by Enflo, [25], that C2{^d) = yd- Both Enflo's argument, and 
subsequent work of Bourgain, Milman and Wolfson [18], rely on nonlinear 
notions of type. These proofs strongly use the structure of the whole cube, 
and therefore seem not applicable for subsets of the cube. In Section 6.2 we 
prove the following strengthening of Enflo's bound: 

Theorem 1.10 (The metric Ramsey problem for the discrete cube). 
There exist absolute constants C,c such that for every a > 1: 

^dii-i2Mo^) < ^2(f^d;a) < C2<i-^). 



The lower bounds on the Euclidean Ramsey function mentioned above are 
based on the existence of large subsets of the graphs which are within distortion 
a from forming an equilateral space. In particular for the discrete cube this 
corresponds to a code of large relative distance. Essentially, our upper bounds 
on the Euclidean Ramsey function show that for a fixed size, no other subset 
achieves significantly better distortions. 

In [38] it was proved that if G = {V, E) is a d-regular graph, d > 2>, 
with girth g, then C2(G) > c-^^. In Section 6.1 we prove the following 
strengthening of this result: 

Theorem 1.11 (The metric Ramsey problem for large girth graphs). 
Let G = {y,E) be a d-regular graph, d > 3, with girth g. Then for every 
l<a<^, 

R2{G-a)<C{d-l)-^\V\, 

where C, c > are absolute constants. 
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The proofs of Theorem 1.10 and Theorem 1.11 use the notion of Markov 
type, due to K. Ball [2]. In addition, we need to understand the algebraic 
properties of the graphs involved (Krawtchouk polynomials for the discrete 
cube and Geronimus polynomials in the case of graphs with large girth). 

2. Metric composition 

In this section we introduce the notion of metric composition, which plays 
a basic role in proving both upper and lower bounds on the metric Ramsey 
problem. Here we introduce this construction and use it to derive some non- 
trivial upper bounds. The bounds achievable by this method are generally 
not tight. For the Ramsey problem on Ip, better upper bounds are given in 
Sections 4 and 5. In Section 3 we use metric composition in the derivation of 
lower bounds. 

2.1. The basic definitions. 

Definition 2.1 (Metric composition). Let M be a finite metric space. Sup- 
pose that there is a collection of disjoint finite metric spaces N^ associated with 
the elements x of M. Let J\f = {Nx}xeM- For (3 > 1/2, the /3-composition 
of M and N , denoted by C = Afa[AA], is a metric space on the disjoint union 
{JxNx- Distances in C are defined as follows. Let x,y £ M and u £ Nx,v £ Ny] 
then: 

, . X idNAu,v) x = y 
dc[u,v) = < 

yl3-idM{x,y) x/y, 

where 7 = '".'"^^" ^^) '{ . It is easily checked that the choice of the factor 
/37 guarantees that dc is indeed a metric. If all the spaces Nx over x £ M 
are isometric copies of the same space A^, we use the simplified notation C = 
Mp[N]. 

Informally stated, a metric composition is created by first multiplying the 
distances in M by P'j, and then replacing each point x of M by an isometric 
copy of Nx. 

A related notion is the following: 

Definition 2.2 (Composition closure). Given a class Ai of finite metric 
spaces, we consider comp/3(A4), its closure under > /3-compositions. Namely, 
this is the smallest class C of metric spaces that contains all spaces in A4 , and 
satisfies the following condition: Let M £ M, and associate with every x £ M 
a metric space Nx that is isometric to a space in C. Also, let /?' > /3. Then 
M^/[A/] is also in C. 
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2.2. Generic upper bounds via metric composition. We need one more 
definition: 

Definition 2.3. A class C of finite metric spaces is called a metric class if 
it is closed under isometries. C is said to be hereditary, if M £ C and N <Z M 
imply N £ C. The class is said to be dilation invariant if (M,d) G C implies 
that (M, Xd) £ C for every A > 0. 

Let M^ = {X;cjii{X) < a} denote the class of all metric spaces that 
a-embed into some metric space in Ai. Clearly, A^*^ is a hereditary, dilation- 
invariant metric class. 

We recall that Rc{X) is the largest cardinality of a subspace of X that is 
isometric to some metric space in the class C. 

Proposition 2.4. Let C he a hereditary, dilation invariant metric class 
of finite metric spaces. Then, for every finite metric space M and a class 
J\f = {N^]x(zM, and every [3 > 1/2, 

Rc{Mp[N]) < Re{M) ■ maxRdN,). 

xeM 

In particular, for every finite metric space N, 

RciMf,[N]) < Rc{M)Rc{N). 

Proof. Let m = Rc{M) and k = maXx^M Rci^^x)- Fix any X C U^jA'^j; 
with \X\ > mk. For every z £ M let X^ = X nN^. Set Z = {z G M; A^ / 0}. 
Note that \X\ = Xlzez \^z\ so that if \Z\ < m then there is some y £ M with 
\Xy\ > k. In this case, the set Xy consists of more than k elements in X, the 
metric on which is isometric to a subspace of Ny, and therefore is not in C. 
Since C is hereditary this implies that X ^ C. Otherwise, \Z\ > m. Fix for 
each z £ Z some arbitrary point Uz £ Xz and set Z' = {uz', z £ Z}. Now, Z' 
consists of more than m elements in X, the metric on which is a /37-dilation of 
a subspace of M, hence not in C. Again, the fact that C is hereditary implies 
that X ^C. D 

In what follows let Rc{A,n) = Rc{A;l,n). Recall that Rc{A;l,n) > t 
if and only if for every X £ A with \X\ = n, there is a subspace of X with t 
elements that is isometric to some metric space in the class C. 

Lemma 2.5. LetC be a hereditary, dilation invariant metric class of finite 
metric spaces. Let A he a class of metric spaces, and let 6 £ (0, 1). // there 
exists an integer m > 1 such that Rc{A, m) < m , then for any [3 > 1/2, and 
infinitely many integers n: 

Rc{cova.Y>p{A),n) < n . 
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Proof. Fix some /? > 1/2. Let ^ G ^ be such that \A\ = m > 1 and 
-Rc(^) < ""T-"^- Define inductively a sequence of metric spaces in comp^(^) 
by: Ai = A and ^j+i = ^/3[^i]. Proposition 2.4 implies that i?c(^i+i) < 
i?c(^j)-Rc(^) < Rc{A)m^- It follows that i?c(^j) < m'^ = \Ai\^ . U 

Lemma 2.6. Let C he a nonempty hereditary, dilation invariant metric 
class of finite metric spaces. Let A be a class of finite metric spaces, such that 
Rc{A,m) < m for some integer m {i.e., there is some space A ^ A with no 
isometric copy in C). Then there exists 5 G (0, 1), such that for any (3 > 1/2, 
and infinitely many integers n: 

Rc{cova.Y>p{A),n) < n . 

Proof. Let m be the least cardinality of a space ^ G ^ of with no isometric 
copy in C. Since C is nonempty and hereditary, m> 2. Define 6 hy m — 1 = m . 
Now apply Lemma 2.5. D 

Lemma 2.6 can be applied to obtain nontrivial bounds on various metric 
Ramsey functions. 

Corollary 2.7. Let C he a hereditary, dilation invariant metric class 
which contains some, hut not all finite metric spaces. Then there exists a 
6 G (0, 1), such that Rc{n) < n for infinitely many integers n. 

Proof. We use Lemma 2.6 with A = comp^(^) = the class of all metric 
spaces. D 

Let A^ be a fixed class of metric spaces and a > 1. The following corollary 
follows when we apply Corollary 2.7 with C = ^A^ as defined above. 

Corollary 2.8. Let M he a metric class of finite metric spaces and 
a > 1. The following assertions are equivalent: 

a) There exists an integer n, such that Rj^{a,n) < n. 

b) There exists 5 G (0,1), such that RM{ct,n) < n^ for infinitely many 
integers n. 

For our next result, recall that a normed space X is said to have cotype 
q if there is a positive constant C such that for every finite sequence xi, . . . 

. . . , Xfn G v\ , 



E 



m 

SiXi 






>C{Y,\\^^¥ 



^i=l 
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where £i,...,em are i.i.d. ±1 Bernoulli random variables. It is well known 
(see [46]) that for 2 < g < c>o, ^^ has cotype q (and it does not have cotype q' 
for any q' < q). 

Corollary 2.9. Let X be a normed space. Then the following assertions 
are equivalent: 

a) X has finite cotype. 

b) For any a > 1, there exists S G (0, 1) such that for infinitely many 
integers n, Rx{a,n) < n^ . 

c) There exists a > 1 and an integer n such that Rx{a,n) < n. 

Proof. To prove the implication a) =^ b), fix a > 1. Now, since X 
has finite cotype, there is an integer h such that d{i[!^,Z) > a for every 
/i-dimensional subspace Z of X, where d{-,-) is the Banach-Mazur distance. 
This implies that for some e > 0, an e-net £ in the unit ball of i^ does not 
a-embed into X. This follows from a standard argument in nonlinear Banach 
space theory. Indeed, a compactness argument would imply that otherwise 
BJ^, the unit ball of £^, a-embeds into X. By Rademacher's theorem (see for 
example [12]) such an embedding must be differentiable in an interior point 
of BJ^. The derivative, T, is a linear mapping which is easily seen to satisfy 
||T|| • \\T'^\\ < a, so that d{i.^^,Z) < a for the subspace Z = T{l'^). Apply 
Corollary 2.8 with M = X, and n = \£\ to conclude that b) holds. 

The implication b) =^ c) is obvious, so we turn to prove that c) =^ a). 
Assume that X does not have finite co-type, and fix some < e < a — 1. By 
the Maurey-Pisier theorem (see [43] or Theorem 14.1 in [23]), it follows that 
for every n, P^ (q — e)-embeds into X. Since P^ contains an isometric copy of 
every n-point metric space, we deduce that for each n, Rx{ct, n) = n, contrary 
to our assumption c). D 

We now need the following variation on the theme of metric composition. 

Definition 2.10. A family of metric spaces M is called nearly closed under 
composition, if for every A > 1, there exists some (3 > 1/2 such that cj\f{X) < A 
for every X £ compo(AA). In other words, 

A 

comp^(AA) C M^. 

We have the following variant of Corollary 2.8: 

Lemma 2.11. Let M be a metric class of finite metric spaces and let J\f 
be some class of finite metric spaces which is nearly closed under composi- 
tion. Assume that there is some space in M which does not a-embed into any 
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space in Ai. Then there exists 5 £ (0,1), such that for every 1 < a' < a, 
Rm{-^j'^\^) ^ "^ /o?" infinitely many integers n. 

Proof. Fix some a' < a and let A = a/a' . As J\f is nearly closed under 

A 

composition there exists /3 > 1/2 such that compo(AA) C J\f^ . This means 
that for every Z G comp^(AA) there exists some N £ M that is A-equivalent 
to Z. 

For every integer p let k{p) = Rm{J^', oi\p). If \Z\ = \N\ = n, then there 
is X C iV such that cx(X) < a' and \X\ > k{n). Let Y (1 Z be the set 
corresponding to X under the A-equivalence between Z and N. Then, |y| = 
\X\ > k{n) and by composition of maps, cjniY) ^ Act' = a. That is, every 
n-set Z in comp^(AA) contains a k{n) subset Y that a-embeds into a space 

inA^; i.e. Y £ Ai^ . In our notation, this means that k{n) < Rq {couip a{M) , n) , 

where C = M^ . 

The assumption made in the lemma about M means that Rc{M, m) < m 
for some integer m. By Lemma 2.6 there exists 5 G (0, 1) such that for infinitely 
many integers n, 

RmU^'iO^ 1^) = k{n) < i?c(comp^(AA),n) < n , 

as claimed. D 

Next, we give a several results that demonstrate the applicability of 
Lemma 2.11. 

Proposition 2.12. Let (X, || • ||) he a normed space. The class M of 
finite subsets of X is nearly closed under composition. 

Proof. Fix some A > 1. Let Z £ comp^(A^) for some /3 > 1/2 to be 
determined later. We prove that Z can be A-embedded in X. The proof is by 
induction on the number of steps taken in composing Z from spaces in A4. If 
Z £ M there is nothing to prove. Otherwise, it is possible to express Z as 
Z = M^[AA], where M £ Ai and J\f = {Nz}zeM such that each of the spaces N^ 
is in comp^(A^) and can be created by a shorter sequence of composition steps. 
By induction we assume that there exists /3 for which N^ can be A-embedded 
in X. Fix for every z £ M , (p^ : Nz -^ X satisfying: 

Vn,f £ Nz, dN,iu,v) < \\(l)z{u) - 4>z{v)\\ < XdN^iu,v), 

and for all n £ Nz, \\(/)ziu)\\ < Adiam(A^2) (this can be assumed by an appro- 
priate translation). 

Define (p : Z ^ X as follows: for every u £ Z , let z £ M he such that 
u £ Nz, then 6(u) = B^ ■ z + (j)z(u), where 7 = "^'^^^ "^"?i ^ i, . 
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We now bound the distortion of 0. Assume f3 > 2A. Consider first u,v G 
Nz for some z G M. 

dz{u,v) = (InSu^v) < \\4)z{u) - (l)z{v)\\ < XdN^u^v) = \dz{u,v). 

Now, let u G Nx,v G Ny, for x / y G M, 

U{u) - 0WII < h\\x - y\\ + \\<P,{u) - (t>y{v)\\ 

< (5-f\\x - y\\ + A(diam(iV^) + diam(iVy)) 

<^{P + 2X)\\x-y\\ = ^^dz{u,v). 

Similarly, 

U{u) - 0(^)11 > I3^\\x - y\\ - \\<P^{u) - (t>y{v)\\ 

> Pj\\x - y\\ - A(diam(iV^) + diam(iVy)) 

B — IX 

<j{P-2X)\\x-y\\ = '^^dz{u,v). 

Hence if /3 > 2A^^, we have, 

,. /,N f ^ /3 + 2A1 

dist((pj < max < A, — — — f = A. D 

Recall that a normed space X is said to be A finitely representable in a 
normed space Y if for any finite dimensional linear subspace Z C X and every 
r] > there is a subspace H^ of y such that d{Z, W) < A + r/. 

Corollary 2.13. Let X and Y be normed spaces and a > 1. The fol- 
lowing are equivalent: 

1) X is not a-finitely representable in Y . 

2) There are 77 > and S G (0,1) such that i?y(X;Q + ri,n) < n for 
infinitely many integers n. 

3) There is some 77 > and an integer n such that Ry{X; a + r],n) < n. 

Proof. If X is not a-finitely representable in Y then there is a finite 
dimensional linear subspace Z oi X whose Banach-Mazur distance from any 
subspace of Y is greater than a. As in the proof of Corollary 2.9, a combination 
of a compactness argument and a differentiation argument imply that there is a 
finite subset S oi X which does not (a + 2r]) embed in Y for some r/ > 0. Since 
the subsets of X are nearly closed under composition, by applying Lemma 2.11, 
we deduce the implication 1) =^ 2). 
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The implication 2) =^ 3) is obvious, so we turn to show 3) =^ 1). Let 
A C X he a. finite subset that does not a + r] embed in Y, and let Z be A's 
linear span. Clearly d{Z, W) > a + ij for any linear subspace W of Y. It follows 
that X is not a-finitely representable in y. D 

Recall that a graph H is called a minor of a graph G ii H is obtained from 
G by a sequence of steps, each of which is either a contraction or a deletion of 
an edge. We say that a family J^ of graphs is minor-closed if it is closed under 
taking minors. The Wagner conjecture famously proved by Robertson and 
Seymour [51], states that for any nontrivial minor-closed family of graphs J-', 
there is a finite set of graphs, TC, such that G G .7-" if and only if no member 
of W is a minor of G. We say then that J-' is characterized by the list Ti. of 
forbidden minors. For example, planar graphs are precisely the graphs which 
do not have -ftTs^s or K^ as minors, and the set of all trees is precisely the set 
of all connected graphs with no K^ minor. 

There is a graph-theoretic counterpart to composition. Namely, let G = 
{V, E) be a graph, and suppose that to every vertex x € V corresponds a graph 
Hx = (Vx, Ex) with a marked vertex r^ G Vx, where the Hx are disjoint. The 
corresponding graph composition, denoted G[{Hx}x£v]i is a graph with vertex 
set Uxt^v^x, and edge set: 

E = {[u,v]; x£V, [u,v]£Ex}U{[rx,ry]; [x,y]£E}. 

The composition closure of a family of graphs J-' can be defined similarly to 
Definition 2.2, and family J^ is said to be closed under composition if it equals 
its closure. 

Recall that a connected graph G is called bi-connected if it stays connected 
after we delete any single vertex from G (and erase all the edges incident with 
it). The maximal bi-connected subgraphs of G are called its blocks. 

We make the following elementary graph-theoretic observation: 

Proposition 2.14. Let H be a bi-connected graph {with > 3 vertices) 
that is a minor of a graph G. Then H is a minor of a block of G. 

Proof. Consider a sequence of steps in which edges in G are being shrunk 
to form H. If there are two distinct blocks Bi,B2 in G that are not shrunk to 
a single vertex, then the resulting graph is not bi-connected. Indeed, there is 
a cut- vertex a in G that separates Bi from B2, and the vertex into which a is 
shrunk still separates the shrunk versions of Bi,B2. This observation means 
that in shrinking G to H, only a single block B oi G retains more than one 
vertex. But then H \s a, minor of B, as claimed. D 

In the graph composition described above, each vertex r^; G Kc is a cut 
vertex. Consequently, each block of the composition is either a block of G (the 
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subgraph induced by the vertices {rx;x £ V} is isomorphic with G) or of one 
of the Hx (that is isomorphic with the subgraph induced on Vx). We conclude: 

Proposition 2.15. Let J- he a minor- closed family of graphs character- 
ized by a list of bi- connected forbidden minors. Then T is closed under graph 
composition. 

Let J- again be a family of undirected graphs. A metric space M is said 
to be supported on J^ if there exist a graph G £ J-' and positive weights on the 
edges of G such that M is the geodetic, or shortest path metric on a subset of 
the vertices of the weighted G. 

Here is the metric counterpart of Proposition 2.15: 

Proposition 2.16. Let T he a minor- closed family of graphs character- 
ized by a list of hi- connected forbidden minors. Then the class of metrics 
supported on T is nearly closed under composition. 

Proof. Fix some A > 1. Let J-'' be the class of metrics supported on J^. 
Let X £ comp^(.F') for some /3 > 1/2 to be determined later. We prove that 
X can be A-embedded in J^' . The proof is by induction on the number of steps 
taken in composing X from spaces in J^' . If X € J-'' there is nothing to prove. 

Otherwise, there exists a weighted graph G = (V, E, w) in J^. For sim- 
plicity, we identify G with a metric space in J^' , equipped with the geodetic 
metric defined by its weights. It is possible to express X as X = G/3[7Y'], where 
TC' = {H'^}zev such that each of the metric spaces H'^ is in compg{J^'). By in- 
duction we assume that there exists /3 for which each H'^ can be 
A-embedded in J^' . Therefore there exists a family of disjoint weighted graphs 
{H^ = {Vz,Ez,uJz)}z£V: such that for every z £V, there is a noncontractive 
Lipschitz bijection, (p^ '■ H'z -^ Vz, satisfying for any u,v £ H'^, dH'{u,v) < 
dHMz{u)Az{v)) < XdH'^iu,v). 

Let Y = G[{Hz}zgv] be the graph composition of the above graphs. De- 
fine weights w' on the edges of Y as follows: For any z £ V, [u,v] £ Ez, 
let w'{[u,v]) = Wz{[u,v]). For [x,y] £ E, let w'{[rx,ry]) = (3^w{[x,y]), where 
7 = ™f^^'£v- '^™^^ '> (^as in the definition of metric composition). For simplicity, 

we identify Y with the weighted graph defined above as well as the geodetic 
metric defined by this graph. The proof shows that if f3 is large enough, 
then the geodetic metric on the graph composition Y is A-equivalent (and thus 
arbitrarily close) to the metric /3-composition X. Proposition 2.15 implies that 
Y belongs to J^' , which proves the claim. 

Indeed, define the bijection cp : X ^ LJ„g\/Vu as follows: for z £ V, ii 
u £ H'z, then (p{u) = (j)z{u). The geodetic path between any two vertices 
u',v' £ Vz is exactly the same path as in Hz, since the cost of every step 
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outside of Vz exceeds diain{Hz) (by definition of 7). This implies that 

dx{u,v) = dH'^{u,v) <dHMz{u),(t)z{v)) 

= dy {(piu) , (p{v)) < XdH'^{u,v) = Xdx{u,v). 

Also, the distance in the graph composition between u' & Vx and v' £ Vy 
with X ^ y G V,isat most l3^dG{x,y)+2Xm.aXz diam(i72) < j{l3+2X)dG{x,y). 
It follows that for u G H'^ and v G H' 

dx{u,v) = f3-fdG{x,y)<dY{(l){u),(t>{v)) 

<jiP + 2X)dGix,y)=(^(^^^dxiu,v). 

Hence if /3 > ^, we have, dist((/>) < max |a, ^^| = A. D 

Recall that a Banach space X is called super-reflexive if it admits an 
equivalent uniformly convex norm. A finite-metric characterization of such 
spaces was found by Bourgain [16]. Namely, X is super reflexive if and only if 
for every q > there is an integer h such that the complete binary tree of depth 
h doesn't a-embed into X. Let TREE denote the set of metrics supported on 
trees. Since any weighted tree is almost isometric to a subset of a deep enough 
complete binary tree, we conclude using Lemma 2.11. 

Corollary 2.17. Let X be a Banach space. Then the following asser- 
tions are equivalent: 

a) X is super-reflexive. 

b) For any a > 1 there exists 5 < 1 such that for infinitely many integers n, 

RxiTREE;a,n) <n^. 

c) For any a > 1 there exists an integer n such that 

Rx{TREE;a,n) < n. 

3. Metric Ramsey-type theorems 

In this section we prove Theorem 1.3; i.e., we give an n^^^' lower bound 
on R2{a,n) for a > 2. 

The proof actually establishes a lower bound on RjjMict,n). The bound 
on i?2 follows since ultrametrics embed isometrically in £2. The lower bound 
for embedding into ultrametrics utilizes their representation as hierarchically 
well-separated trees. We begin with some preliminary background on ultramet- 
rics and hierarchically well-separated trees in Section 3.1. We also note that 
our proof of the lower bound makes substantial use of the notions of metric 
composition and composition closure which were introduced in Section 2. 
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We begin with a description of the lemmas on which the proof of the lower 
bound is based and the way they are put together to prove the main theorem. 
This is done in Section 3.2. Detailed proofs of the main lemmas appear in 
Sections 3.3-3.6. Most of the proof is devoted to the case where a is a fixed, 
large enough constant. In Section 3.7, we extend the proof to apply for every 
a > 2. 

3.1. Ultrametrics and hierarchically well- separated trees. Recall that an 
ultrametric is a metric space {X, d) such that for every x,y, z G X, 

d{x,z) < Taax{d{x,y),d{y, z)}. 

A more restricted class of metrics with an inherently hierarchical structure 
plays a key role in the sequel. Such spaces have already figured prominently 
in earlier work on embedding into ultrametric spaces [3], [5]. 

Definition 3.1 ([3]). For A; > 1, a k- hierarchically well-separated tree 
(fc-HST) is a metric space whose elements are the leaves of a rooted tree T. 
To each vertex u £ T there is associated a label A(u) > such that A{u) = 
if and only if n is a leaf of T. It is required that if a vertex w is a child of a 
vertex v then A(u) < A{v)/k . The distance between two leaves x,y £ T is 
defined as A{lca{x,y)), where lca{x,y) is the least common ancestor of x and 
y in T. 

A A;-HST is said to be exact if A{u) = A(v)/k for every two internal 
vertices where u is a child of v. 

First, note that an ultrametric on a finite set and a (finite) 1-HST are 
identical concepts. Any fc-HST is also a 1-HST, i.e., an ultrametric. However, 
when fc > 1, a fc-HST is a stronger notion which has a hierarchically clustered 
structure. More precisely, a fc-HST with diameter D decomposes into subspaces 
of diameter at most D/k and any two points at distinct subspaces are at 
distance exactly D. Recursively, each subspace is itself a fc-HST. It is this 
hierarchical decomposition that makes fc-HST's useful. 

When we discuss A;-HST's, we freely use the tree T as in Definition 3.1, 
the tree defining the HST. An internal vertex in T with out-degree 1 is said 
to be degenerate. If u is nondegenerate, then A(u) is the diameter of the sub- 
space induced on the subtree rooted by u. Degenerate nodes do not influence 
the metric on T's leaves; hence we may assume that all internal nodes are 
nondegenerate (note that this assumption need not hold for exact /c-HST's). 

We need some more notation: 

Notation 3.2. Let UM denote the class of ultrametrics, and fc-HST denote 
the class of fc-HST's. Also let EQ denote the class of equilateral spaces. 
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The following simple observation is not required for the proof, but may 
help direct the reader's intuition. More complex connections between these 
concepts do play an important role in the proof. 

Proposition 3.3. The class of fc-HST's is the k- composition closure of 
the class of equilateral spaces; i.e., /c-HST = comp^(EQ). 

In particular, the class of ultrametrics is the 1- composition closure of the 
class of equilateral spaces; i.e., UM = comp;^(EQ). 

We recall the following well known fact (e.g. [34]), that allows us to reduce 
the Euclidean Ramsey problem to the problem of embedding into ultrametrics: 

Proposition 3.4. Any ultrametric is isometrically embeddable in £2- In 
particular, 

R2{a,n) > R-[ju{a,n). 

This proposition can be proved by induction on the structure of the tree 
defining the ultrametric. It is shown inductively that each rooted subtree 
embeds isometrically into a sphere with radius proportional to the subtree's 
diameter, and that any two subtrees rooted at an internal vertex are mapped 
into orthogonal subspaces. 

When considering Lipschitz embeddings, the /c-HST representation of an 
ultrametric comes naturally into play. This is expressed by the following vari- 
ant on a proposition from [4]: 

Lemma 3.5. For any k > 1, any ultrametric is k- equivalent to an exact 
k-BST. 

Lemma 3.5 is proved via a simple transformation of the tree defining the 
ultrametric. This is done by coalescing consecutive internal vertices, whose 
labels differ by a factor which is less than k. The complete proof of Lemma 3.5 
appears in Section 3.5 

We end this section with a proposition on embeddings into ultrametrics, 
which is implicit in [3]. Although this proposition is not used in the proofs, it 
is useful for obtaining efficient algorithms from these theorems. 

Lemma 3.6. Every n-point metric space is n-equivalent to an ultrametric. 

Proof. Let M be an n-point metric space. We inductively construct an 
n-point HST X with diam(X) = diam(M) and a noncontracting n-Lipschitz 
bijection between M and X. 

Define a graph with vertex set M in which [u, v] is an edge if and only 
if dM{u,v) < -^^^ — -■ Clearly, this graph is disconnected. Let Ai, . . . ,Am 
be the vertex sets of the connected components. By induction there are 
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HST's Xi, . . . ,Xm with diam(Xj) = diam((Aj, ^m)) < diaiii(M) and bijec- 
tions fi'.Ai^ Xi such that for every u,v £ Ai, dM{u,v) < dxi{fi{u), fi{v)) < 
\Ai\dMiu,v) < ndM{u,v). Let Tj be the tree defining Xi. We now con- 
struct the HST X whose defining labelled tree T is rooted at z. The root's 
label is A{z) = diani(M) and it has m children, where the ith child, Ui, 
is a root of a labelled tree isomorphic to Tj. Since A{ui) = diani(Xj) < 
diam(M) = diani(X) = A{z), the resulting tree T indeed defines an HST. 
Finally, ii u £ Ai and v € Aj for i ^ j then d]w(u, f) > diam(M)/n. Since 
diam(X) = A(z) = diani(M), the inductive hypothesis implies the existence 
of the required bijection. D 

3.2. An overview of the proof of Theorem 1.3. In this section we describe 
the proof of the following theorem: 

Theorem 3.7. There exists an absolute constant C > such that for 
every a > 2, 

RvMia,n) > n^"^^ . 

By Proposition 3.4, the same bound holds true for R2{a,n). 

We begin with an informal description and motivation. The main lemmas 
needed for the proof are stated, and it is shown how they imply the theorem. 
Detailed proofs for most of these lemmas appear in subsequent subsections. 

Our goal is to show that for any a > 2, every n point metric space X 
contains a subspace which is a-equivalent to an ultrametric of cardinality > 
n^(")^ where ip{a) is independent of n. In much of the proof we pursue an even 
more illusive goal. We seek large subsets that embed even into fc-HST's (recall 
that this is a restricted class of ultrametrics). A conceptual advantage of this 
is that it directs us towards seeking hierarchical substructures within the given 
metric space. Such structures can be described as the composition closure of 
some class of metric spaces Ai. A metric space in compo(A4) is composed of a 
hierarchy of dilated copies of metric spaces from Ai , and the proof iteratively 
finds such large structures. The class Ai varies from iteration to iteration, 
gradually becoming more restricted, and getting closer to the class EQ. When 
A4 is approximately EQ this procedure amounts to finding a /c-HST (due to 
Proposition 3.3). It is therefore worthwhile to consider a special case of the 
general problem, where X £ compo(A^), and we seek a subspace of X that is 
a-equivalent to a /c-HST. 

It stands to reason that if spaces in Ai have large Ramsey numbers, then 
something similar should hold true also for spaces in comp^(A^). After all, if 
f3 is large, then the copies of dilated metric spaces from Ai are hierarchically 
well-separated. This would have reduced the problem of estimating Ramsey 
numbers for spaces in comp/3(A^) to the same problem for the class Ai. 
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While this argument is not quite true, a shght modification of it does 
indeed work. For the purpose of this intuitive discussion, it is convenient to 
think of f3 as large, in particular with respect to k and a. Consider that X is 
the /3-composition of M G A4 and a set of \M\ disjoint metric spaces {Ni}iQM, 
Ni G comp^(A^). Assume (inductively) that each Ni contains a subspace N^ 
that is a-equivalent to a fc-HST Hi of size |A^j|'^. Find a subspace M' of M 
that is also a-equivalent to a fc-HST K and attach the roots of the appropriate 
Hi''s to the corresponding leaves of K (with an appropriate dilation). This 
yields a /c-HST H, and by the separation property of compositions with large 
/?, we obtain a subspace X' of X which is a-equivalent to H. However, the size 
of the final subspace X' = Ujgjv/'-^i depends not only on the size of M' , the 
subspace we find in M, but also on how large the chosen A^^'s are. Therefore, 
the correct requirement is that M' satisfies: 

This gives rise to the following definition: 

Definition 3.8 (The weighted Ramsey function). Let Ai,J\f he classes of 
metric spaces. Denote by V'x(AA, a) the largest < V' < 1 such that for every 
metric space X £ J\f and any weight function w : X ^ R^, there is a subspace 
y of A that a-embeds in M and satisfies: 

^^ 

(*) E ^(^)^ ^ ( E 

When M is the class of all metric spaces, it is omitted from the notation. 

In what follows the notion of a weighted metric space refers to a pair 
{X, w), where X is a metric space and w : X ^ IR+ is a weight function. 

The following is an immediate consequence of Definition 3.8 (by using the 
constant weight function w^x) = 1). 

Proposition 3.9. 

i?A4(AA;a,n)>n^^(^'"). 

In particular, 

RMici,n) >n^^("). 

We note that it is possible to show, via the results of Section 2, that in 
the setting of embedding into composition classes, and in particular in our case 
of /c-HST's or ultrametrics, the last inequality in Proposition 3.9 holds with 
equality for infinitely many n's. 
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The entire proof is thus dedicated to bounding the weighted Ramsey func- 
tion when the target metric class is the class of ultrametrics. The proofs in 
the sequel produce embeddings into fc-HST's and ultrametrics. In this context, 
the following conventions for tpjn {J\f, a) are useful: 

• ipk{M,a) = V'fc-HST(AA,a). In particular, Vfe(a) = V'fc-HST(a)- 

• ip{M,a) = ipi{M,a) = '0um(AA, a). In particular, ip{a) = '0UM(a)- 

The following strengthening of Theorem 3.7 is the main result proved in 
this section. 

Theorem 3.7'. There exists an absolute constant C > such that for 
every a > 2, 

V^(a)>l-Ci^. 
a 

Our goal can now be rephrased as follows: given an arbitrary weighted 
metric space {X, w), find a subspace of X, satisfying the weighted Ramsey con- 
dition (*) with ipia) as in Theorem 3.7', that is a-equivalent to an ultrametric. 

Before continuing with the outline of the proof, we state a useful property 
of the weighted Ramsey function. When working with the regular Ramsey 
question it is natural to perform a procedure of the following form: first find a 
subspace which is ai-embedded in some "nice" class of metric spaces, then find 
a smaller subspace of this subspace which is 02 equivalent to our target class 
of metric spaces, thus obtaining overall aia2 distortion. If the first subspace 
has size n' > n"^^ and the second is of size n" > n"^^ then n" > n"^^"^^ . 

The weighted Ramsey problem has the same super-multiplicativity prop- 
erty: 

Lemma 3.10. Let A4,A^, "P be classes of metric spaces and ai,a2 > 1. 
Then 

ipM{'P,o:ia2) > iPmWiOii) ■ tpAfi'P,a2). 

The interpretation of this lemma (proved in §3.3) is as follows: Suppose 
that we are given a metric space in V and we seek a subspace that embeds 
with low distortion in M, and satisfies condition (*). We can first find a 
subspace which ai-embeds in M and then a subspace which a2-embeds in A4. 
In the course of this procedure we multiply the distortions and the V^'s of the 
corresponding classes. 

The discussion in the paragraph preceding Definition 3.8 on how Ramsey- 
type properties of class Ai carry over to comp^(7W), leads to the following 
proposition: If for every X G Ai and every w : X ^ M"^ there is a subspace 
Y G X, satisfying the weighted Ramsey condition (*) with parameter ^, which 
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is a-equivalent to a /c-HST, then the same holds true for every M £ comp^(A4). 
In our notation, we have the fohowing lemma (proved in §3.3): 

Lemma 3.11. Let M be a class of metric spaces. Let k > 1 and a > 1. 
Then for any (3 > ak, 

V'fc(comp^ (>!),«) ='4)k{M,a). 

In particular for (3 > a, 

^(comp^(A^),a) = ■i/;(A4,a). 

The following simple notion is used extensively in the sequel. 

Definition 3.12. The aspect ratio of a finite metric space M, is defined as: 
^,,^, diam(M) 

^M) = — — ; / . . 

mm^^ydM{x,y) 

When \M\ = 1 we use the convention <1>(M) = 1. We note that $(M) can 
be viewed as M's normalized diameter, or as its Lipschitz distance from an 
equilateral metric space. 

Again, it is helpful to consider the A;-HST representation of an ultramet- 
ric Y. In particular, notice that in this hierarchical representation, the number 
of levels is 0(log^$(y)). In view of this fact, it seems reasonable to expect 
that when ^{X) is small it would be easier to find a large subspace of X that 
is close to an ultrametric. This is, indeed, shown in Section 3.4. 

Definition 3.13. The class of all metric spaces M with aspect ratio $(M) 
< $, for some given parameter $, is denoted by J\f{^). Two more conventions 
that we use are: For every real $ > 1, 

• 'ip{^,a) = ilj{Af{^),a). Similarly Vfcl^,") = V'A:(Ar($),a), and in gen- 
eral where A4 is a class of metric spaces, ipMi^, «) = ipMi-^{^)i <^)- 

• comp^(<5) = comp^(AA(<I>)). 

The main idea in bounding ^(<5, a) is that the metric space can be decom- 
posed into a small number of subspaces, the number of which can be bounded 
by a function of <I>, such that we can find among these, subspaces that are 
far enough from each other and contain enough weight to satisfy the weighted 
Ramsey condition (*). Such a decomposition of the space yields the recursive 
construction of a hierarchically well-separated tree, or an ultrametric. This is 
done in the proof of the following lemma. A more detailed description of the 
ideas involved in this decomposition and the proof of the lemma can be found 
in Section 3.4. 
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Lemma 3.14. There exists an absolute constant C" > such that for every 
a>2 and $ > 1: 

a 

Note that for the class of metric spaces with aspect ratio <l> < exp(0(a)), 
Lemma 3.14 yields the bound stated in Theorem 3.7'. 

Combining Lemma 3.14 with Lemma 3.11 gives an immediate consequence 
on /3-composition classes: for f3 > a, 

(1) ^(comp,($),a) =^($,a) > i _ c'i^i^±l^M4^. 

We now pass to a more detailed description of the proof of Theorem 3.7'. 
Let X be a metric space and assume that for some specific value of a we can 
prove the bound in the theorem (e.g., this trivially holds for a = ^{X) where 
we have ^p{X, a) = 1). 

Let X be an arbitrary metric space and let X be a subspace of X that is 
a-equivalent to an ultrametric, satisfying the weighted Ramsey condition (*) 
with ip = ilj{X,a). We will apply the following "distortion refinement" pro- 
cedure: find a subspace of X that is (a/2)-equivalent to an ultrametric, sat- 
isfying condition (*) with ip' > {1 — C'J^EJiy This implies that ^l^{X,a/2) > 
(1 — C"-^^^^)ip{X, a). Theorem 3.7' now follows: we start with a = ^{X) and 
then apply the above distortion refinement procedure iteratively until we reach 
a distortion below our target. It is easy to verify that this implies the bound 
stated in the theorem. 

The distortion refinement uses the bound in (1) on ^(comp^ (<!)), a"), in 
the particular case a" < a/2 and $ < exp(0(Q)). This is useful due the 
following claim: if X is a-equivalent to an ultrametric then it contains a sub- 
space X' which is (1 + 2//3)-equivalent to a metric space Z in comp^(<l>), for 
<1> < exp(0(a)), and which satisfies condition (*) with ip" > (1 — 2-^^Sii)^_ gy 
(1) we obtain a subspace Z' of Z which is a"-equivalent to an ultrametric. By 
appropriately choosing all the parameters, we see from Lemma 3.10 that there 
is a subspace X" of X' which is (a/2)-equivalent to an ultrametric, and the 
desired bound on ■ilj{X,a/2) is achieved. 

The proof of the above claim is based on two lemmas relating ultrametrics, 
/c-HST's and metric compositions. Let X be a-equivalent to an ultrametric Y. 
The subspace X' is produced via a Ramsey-type result for ultrametrics which 
states that every ultrametric Y contains a subspace Y' which is a'-equivalent 
to a /c-HST with k > a' . (Lemma 3.5 can be viewed as a non-Ramsey result of 
this type when k = a'.) Moreover, we can ensure that condition (*) is satisfied 
for the pair Y' gY with the bound stated below. 
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Lemma 3.15. For every k> a> 1, 

log a 

The proof of this lemma involves an argument on general tree structures 
described in Section 3.5. 

Now, by Lemma 3.10 we obtain a subspace X' that is a'a-equivalent to a 
/c-HST for k > a' . If fc is large enough then the subtrees of the fc-HST impose 
a clustering of X' . That is, each subtree corresponds to a subspace of X' of 
very small diameter, whereas the a distortion implies that the aspect ratio of 
the metric reflected by inter-cluster distances is bounded by a. By a recursive 
application of this procedure we obtain a metric space in comp^(a), with the 
exact relation between k,a, and f3 stated in the lemma below. The details of 
this construction are given in Section 3.6. 

Lemma 3.16. For any a, f3 > 1, if a metric space M is a-equivalent to 
an a/3-HST then M is (1 + 2/ (3)- equivalent to a metric space in comp^(a). 

The distortion refinement process described above is formally stated in 
the following lemma: 

Lemma 3.17. There exists an absolute constant C" > such that for 
every metric space X and any a > 8, 



^(l,f)>^(l,«)(l-C"i^) 



Proof. Fix a weight function w : X ^ R'^ , let X be a subspace of X 
that is a equivalent to an ultrametric Y, and satisfies the weighted Ramsey 
condition (*) with ip{X,a). Fix two numbers a',/3 > 1 which will be de- 
termined later, and set k = aa'(3. Lemma 3.15 implies that Y contains a 
subspace Y' which is a'-equivalent to a fc-HST, and Y' satisfies condition (*) 

with ^fc(UM,a') > 1 °^^ '^, ' . By mapping X into an ultrametric F, and 

then mapping the image of X in y into a fc-HST, we apply Lemma 3.10, obtain- 
ing a subspace X' of X that is a' a-equivalent to a fc-HST VF, which satisfies 
condition (*) with exponent Vfc(UM,a') • ^{X,a) > (l - ^^^f^) V'(^,a)- 
Denote <& = ol a. We have that X' is ^-equivalent to a <I>/3-HST and therefore 
by Lemma 3.16, X' is (1 + 2//3) equivalent to a metric space Z in comp^($). 
Now, we can use the bound in (1) to find a subspace Z' of Z that is /3-equivalent 
to an ultrametric, and satisfies condition (*) with exponent Vlcompa ($),/?). 
By mapping X' into Z G comp^($) and finally to an ultrametric, we apply 
Lemma 3.10 again, obtaining a subspace X" of X that is /3(1 -|- 2//?) = /? -|- 2 
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equivalent to an ultranietric U, satisfying condition (*) with exponent 

(3 J \ loga' J 

Finally, if we choose /3 = a/2 — 2 and let <& = 2^" (which determines a'), 
we get that 

for an appropriate choice of C" D 

Theorem 3.7' is a straightforward consequence of Lemma 3.17: 

Proof of Theorem 3.7'. By an appropriate choice of C we may assume that 
a > 8. Let X be a metric space and set <1> = ^{X). Recall that il^{X, <I>) = 1. 
Let m = [logaj and M = [log<&]. Lemma 3.17 implies that ilj{X,a/2) > 
il^{X, a) — C"-2^, and so by an iterative application of this lemma we get 

M 

i;{X,a)>i;{X,2"^)>ij{X,2^)-C" J^ ^ 

i=m+l 



>1-C" y A = l-C"^^^±^>l-6C"i^. 



21 2™ ~ a 

1 



D 



This completes the overview of the proof of Theorem 3.7'. Sections 3.3-3.6 
contain the proofs of the lemmas described above. 

Additionally, in Section 3.7 we describe in detail how to achieve Ramsey- 
type theorems for arbitrary values of a > 2. The main ideas that make this 
possible are first, replacing Lemma 3.14 with another lemma that can handle 
distortions 2 + e and second, providing a more delicate application of our 
Lemmas, using the fact that we can find fc-HST's with large k (« 1/e) rather 
than just ultrametrics, to ensure that accumulated losses in the distortion are 
small. 

We end with a discussion on the algorithmic aspects of the metric Ramsey 
problem. Given a metric space X on n points, it is natural to ask wether we 
can find in polynomial time a subspace y of X with n"^ points which is a- 
equivalent to an ultrametric, for ip as in Theorem 3.7. It is easily checked 
that the proofs of our lemmas yield polynomial time algorithms to find the 
corresponding subspaces. Thus, the only obstacle in achieving a polynomial 
time algorithm, is the fact, that the proof of Theorem 3.7' involves 0(log<l>) 
iterations of an application of Lemma 3.17. We seek, however, a polynomial 
dependence only on n. This is remedied as follows: It is easily seen that using 
Lemma 3.6 we can start from -(/'(X, |X|) = 1 rather than 'ip{X,^{X)) = 1. 
Thus we replace the bound of $ with n, and end up with at most O(logn) 
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iterations of Lemma 3.17. This implies a polynomial time algorithm to solve 
the metric Ramsey problem. 

3.3. The weighted metric Ramsey problem, and its relation to metric com- 
position. In this section we prove Lemmas 3.11 and 3.10. We begin with 
Lemma 3.10, which allows us to move between different classes of metric spaces 
while working with the weighted Ramsey problem. 

Lemma 3.10. Let M..,J\f.,V he classes of metric spaces and Qi,a2 > 1- 
Then 

ipM{'Piaia2) > i^Mi-^iOi) ■ipAfi'P,a2)- 

Proof. Let Vi = '^Mi-^^'^i) and ^2 = '^N'^-,Oi2)- Take P £ V and 
a weight function w : P ^ M^. There are a subspace P' of P and an Q2- 
embedding f : P' ^ N, where N £ J\f, and 

xeP' \xeP / 

Similarly, for every weight function w' : N ^ M"'" there exists a subspace 
N' of N and an ai-embedding g : N' ^ M, where M £ Ai, and 

By letting P" = f'^{N'), and for yeN, w'{y) = w{f-''{y))'^\ we get that 

x£P' I VxGP / 

Define h : P" ^ M by h{x) = g{f{x)); then h is an Qia2-embedding. D 

Lemma 3.11 shows that the weighted Ramsey function stays unchanged 
as we pass from a class A4 of metric spaces to its composition closure. To 
repeat: 

Lemma 3.11. Let M be a class of metric spaces. Let k > 1 and a > 1. 
Then for any j3 > ak, 

tljk{compp{M),a) = t(jk{M,a). 

Proof. Since Ai C comp^(A^), clearly ipk{compp{M.),a) < tpk{-M.,ci)- In 
what follows we prove the reverse inequality. 
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Let ^l^ = '^pki■M.,a). Let XGconipo(7W). We prove that for any it;:X^M^ 
there exists a subspace Y oi X and a fe-HST H such that Y is a-equivalent to 
H via a noncontractive a-Lipschitz embedding, and: 



i, 



xeY \xex 



The proof is by structural induction on the metric composition, li X £ M 
then this holds by definition of ip. Otherwise, let M G A^ and M = {Nz}zeM 
be such that X = M/s[M]. 

By induction, for each z £ M, there is a subspace Y^ of A^^^ that is 
a-equivalent to a fc-HST Hz, defined by the tree T^, via a noncontractive 
a-Lipschitz embedding, and 






V 



For a point z £ M let w'{z) = ^„g^ w{u). There exists a subspace Ym 
of M that is a-equivalent to a A;-HST Hm, defined by Tm, via a noncontractive 
a-Lipschitz embedding, and 



z£Ym VzeM / \xex 





Let Y = UzeYM^z- It follows that 

5^ w{xf =Y.H ^(^)^ ^ 

We now construct a fc-HST i7 that is defined by a tree T, as follows. Start 
with a tree T' that is isomorphic to Tm and has labels A(n) = /37 • Ax'j,,j(u) 
(where 7 = ™'^^'g" '|^™^ ^> g^g ^j^ Definition 2.1). At each leaf of the tree 
corresponding to a point z G M, create a labelled subtree rooted at z that is 
isomorphic to Tz with labels as in Tz- Denote the resulting tree by T. Since 
we have a noncontractive a-embedding of Yz in Hz, it follows that A(z) = 
diam(ii^2) < a diam(Y2) < a diam(A^2). Let p be a parent of z in Tm- Since we 
have a noncontractive a-embedding of Ym into Hm, it follows that ^Tm'kV) ^ 
dM{x, y) for some x,y £ Ym- Therefore A{p) > (3^-Tam{dM{x, y);x ^ y £ M}. 
Consequently, /S.{p) / /S.{z) > (3 /a > k. Since Hm and Hz are fe-HST's, it follows 
that T also defines a A;-HST. 

It is left to show that Y is a-equivalent to H. Recall that for each z £ M 
there is a noncontractive Lipschitz bijection fz'-Yz^ Hz that satisfies for 
every u,v £Yz, dYSu,v) < dn^ifziu), fz{v)) < ady^(u,-u). Define f : Y ^ H 
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as follows. If z £ M and u £ Nz, set f{u) = fz{u). Then for u,v £ M such 
that u,v £ Nz we have 

dYiu,v)=dY,iu,v) < dHAfz{u)Jz{v)) 

= dH{f{u)J{v)) < adySujv) = adyiu^v). 

Additionally, we have a noncontractive Lipschitz bijection /m '■ Ym -^ 
Hm that satisfies for every x,y £ Ym, (iy^„(x,y) < dHM{fM{x)jM{y)) < 
advM (x, y)- Hence for x / y G M, and u £ N^, y £ Ny, 

dviu, v) = /3-fdYM {x, y) < p-fdH^ {fM{u), fM{v)) 

= dH{f{u),f{v)) < a(3-fdYM{x,y) = adY{u,v). n 

3.4. Exploiting metrics with hounded aspect ratio. In this section we prove 
Lemma 3.14 (§3.2). That is, we give lower bounds on ^ = ^($,a) which 
depend on the aspect ratio of the metric space, $. 

The proof of the lemma starts by obtaining lower bounds for a restricted 
class of weight functions w. These bounds are then extended to general weights. 
The class of "nice" weight functions is itself divided into two classes. In one 
class we have a lower bound on the minimal weight relative to the total overall 
weight, and the other is the constant weight function. This is formally defined 
as follows: 

Definition 3.18. Fix some g > 1. A sequence x = {xj}^^ of nonnegative 
real numbers will be called q- decomposable if there exists w > such that: 

{i G N; Xi > 0} = < i G N; Xj > - ^Xj > [J{i G N; x, = uj}. 

We will prove the following lemma: 

Lemma 3.19. Let q >2, and t > 8 be an integer. Let (M, d) be an n-point 
metric space and let w : M ^ M"*", be a weight function such that {■w{x)}x<^m 
is q- decomposable. Then there exists a subspace N C M that is it- equivalent 
to an ultrametric and satisfies: 



V 



where ip = [tlog(4g$(M))] 




w{x) 



The proof of Lemma 3.19 uses a decomposition of the metric space M 
into a small number of subspaces. This type of strategy has been used in 
several earlier papers in combinatorics and theoretical computer science, but 
the argument closest in spirit to ours is in [5]. The idea is to consider two 
diametrical points, split the space into shells according to the distance from one 
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of these two points, and discard the points in one of the shehs. Intuitively, we 
would like to discard a shell with small weight. The exact choice is somewhat 
more sophisticated, tailored to ensure the weighted Ramsey condition (*). The 
other shells form two subsets of the space that are substantially separated. By 
an appropriate choice of the parameters, we can guarantee that the union of 
the inner layers has diameter smaller than a constant factor of the diameter of 
the whole space, and hence a smaller aspect ratio. The role of g-decomposable 
weights is as follows: This argument works fairly well for uniform weights, 
and a slight modification of it yields bounds as a function of q (in addition 
to $) when in the weighted case only a few points carry each at least - of the 
total weight. Here the argument splits according to the diameter of the set 
of "heavy" points. If the diameter is small, the previous argument is started 
from a point that resides far away from the heavy points. This guarantees that 
none of the "heavyweights" get eliminated in the above-described process. If 
their diameter is proportional to that of the whole space, it is possible to argue 
similarly to the uniform-weight case, except that we now obtain better bounds, 
since we can make estimates in terms of q (rather than the cardinality of the 
space n). 

The extension of Lemma 3.19 to arbitrary weight functions requires a 
lemma on numerical sequences. This lemma allows us to reduce the case of 
general sequences of weights to g-decomposable ones. 

Lemma 3.20. Fixq > 16 and let x = {xi}'^-^ be a sequence of nonnegative 
real numbers. Denote p = 1 — ~f§F^T^- There exists a sequence y = {yiji^i 

such that yi < Xi for all i > 1, X]j>i ^f — (Si>i^O i ^'^'^ ^^^ sequence 
{yfli^i ^-5 q- decomposable. 

Together these lemmas imply our main lemma: 
Lemma 3.14. For every a > 2 and every $ > 1: 

K^ N^i „ log a + log log 4$ 

^(q),a) > 1 — G , 

a 

where C is a universal constant. 

Proof. Clearly we may assume that a > 32. Let X be a metric space with 
^{X) < $, and w : X ^ M"*" a weight function. Set t = [a/4j. By applying 
Lemma 3.20 to the sequence {w{x)}x<^x with q = 2*, we obtain a weight 
function w' such that w'{x) < w{x) for all x £ X, the sequence {uj'{xy}x£X 
is (/-decomposable, and 



E 

x&X \x&X 



y ^ w' {xY > y^ w{x) 



where p = 1 — —, 
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Let (3 = [tlog(4g<I>(X))]~^/* and apply Lemma 3.19 to the space X and 
weights w" = w'P. We obtain a subspace Y which is 4t-equivalent to an 
ultrametric, such that 

Therefore, 



x&Y 




W[X) 



i){^,a)>p(5> 1 



>l-C 



log2i 



1 



41ogt 21oglog(4<5) 



t J \ t 

log a + loglog(4<5) 



a 



for an appropriate choice of C. 



D 



We now pass to the proof of Lemma 3.20. Let x = {xi}'^^ be a sequence 
of nonnegative real numbers which isn't identically zero. Let p > 0. Recall 
that the {p, oo) norm of x is defined by ||x||p^oo = supj>]^ i^'^x*, where {x*}'^^ 
is the nonincreasing rearrangement of the sequence (I^jD^^^. We will require 
the following numerical fact: 

Lemma 3.21. For every x £ ii as above and every < p < 1: 



\x\ 



P,OD 



> 



1 — P 

2-p 



i/p 



|i/p 



l^l|(i-p)/p' 

U^ oo 



Proof. We can assume without loss of generality that ||x||i = 1 and 
|2;||oo = xi > X2 > ■ ■ ■ > 0. Obviously ||x||p^oo ^ xi so that if xi > 
[(1 — p)/{2 — p)]^'^x-^ ^ we are done. Assume therefore that the reverse 



inequality holds, i.e., xi < j:^- Set a 



X 



P,OD 



and denote j = \a/x\] + 1. 



Note that for every i > 1, Xi < (a/i) '^. Therefore, 

i-i 



'^Xi< {j - l)xi < 



1=1 



a 



Xi < Xi 



a 



+ 1 



ax 



i-p 



+ xi, 



and 






z 



'^/^dz 



1 — p 



a 



i-i 

i-p 



1 — p 



a 



p 



1 — p 



-ax 



i-p 



By summing both inequalities and using the bound on xi we get 

1 i-p 1-P ^ 1 
QXi H > 1, 

1-p ^ 2-p- ' 

which simplifies to give the required result. 



D 



ON METRIC RAMSEY- TYPE PHENOMENA 673 

Proof of Lemma 3.20. We may assume that x is a nonincreasing sequence 
of nonnegative real numbers and that ||a;||i = 1. 

We will prove below that there exist indexes < I < b such that xf > - 

i q 

and: 

I 
(2) s=Y,<+{b-l)xl>l. 

i=l 

If 6 = / assume that I is the minimal index for which (2) holds. It follows 
that in this case S = Ylii=i a;f < 1 + x^ < 2. Similarly if 6 > ^, fix / and 
assume that b is the minimal index for which (2) holds. It follows that S = 

EU< + (ft- 0<<ELi< + (b-i-O^Li + <<! + << 2. 

Define the sequence {yi}^i so that yi = Xi for i < I, yi = Xb ior I < i < b 
and yi = for i > b. It follows that yi < Xi for all i > 1 and ^j>]^ yf = S > 
1 = (E^>i^^F- Since for i < / yj = x^ > f > I = | E.>i 2/f , fo"r K J < b, 
yf = x^ and for j > b, yf = 0, we get that {yf}^i is (/-decomposable. 

It remains to prove (2). Let / > be the largest integer for which 
xf > ^. If ^j=i a;f > 1 we are done. Otherwise, consider the sequence 
z = (x^+i, Xi+2, . . . ). By the choice of /, for i > I, Xi < {2/q)^'P, and thus 

INIloo < {2/qy/r Moreover, j^z^^^ > ^^^^^ • [^f'''^^' > 1, so by 
applying Lemma 3.21 to z we get that ||z||p,oo/||^;||i > 1; i.e., there is an integer 
b > I such that: 

I I 

ib-l)xl>\\z\\^ = l-Y,Xi>'^-Y.''^^■ ° 

We are now in position to prove the main technical lemma. 

Proof of Lemma 3.19. For simplicity denote /3(<1>) = [tlog2(4g<I>)]~^'*. We 
will prove by induction on n that any n point weighted metric space (M, d, w) 
contains a subspace N C M such that: 

/ s /3(<I>(M)) 

^ u;(x)'3(*W) > YI ^(^) 

xGN \xGM / 

and for which there is a noncontractive, 4t-Lipschitz embedding of N into an 
ultrametric H with diaui{H) = diam(M). 

In what follows for every 5 C M we denote w{S) = X^a;e5'"^(^)- 
Let (M, d, w) be a weighted n-point metric space such that w is g-de- 
composable. Without loss of generality we may assume that w{M) = 1 and 
minix^y^M d{x,y) = 1. Denote <I> = <5(M). The latter assumption implies that 
$ = diam(M). In what follows we denote for r > and x G M, B{x,r) = 
{y G M; d{y,x) < r}. The proof proceeds by proving the following claim: 
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Claim 3.22. There exist i G {l,...,t} and xq G M such that if A 
{xo} U B (xo, ^^^) andB = M\B (xq, f ) then: 



Before proving Claim 3.22 we will show how it implies the required result. 
Let i, A, B be as in Claim 3.22. Note that ^ / and diam(^) < | < diam(M). 
In particular it follows that, \A\, \B\ < n so that by the induction hypothesis 
there are subspaces A' C A and B' C B such that 

^^(^)/3(*(A))>^(^)/3(<E.{A)) 

x&A' 
and 

x&B' 

HST's X and Y with diam(X) = diam(^), diam(y) = diam(i?), and noncon- 
tractive embeddings f : A' ^ X^ g : B' ^ Y which are 4t-Lipschitz. Let T 
be the tree defining X and u be its root. Let S be the tree defining Y and v 
be its root. Define a tree R as follows: its root is r and the only two subtrees 
emerging from it are isomorphic to T and S. Label the root of R by setting 
A(r) = diam(M), and leave the labels of T and S unchanged. Note that 

A(r) = diam(M) > max{diam(^),diam(i?)} 

= max{diam(X), diam(y)} = max{A(u), A(w)}, 

so that with these definitions the leaves oi R, X U Y, form an HST with 
diam(X u F) = $ = diam(M). Define h : A' U B' ^ X UY by /lU' = / and 
h\B' = g. U a e A' and b e B' then d{h{a),h{b)) = <1> > d{a,b). Hence h is 
noncontracting. Additionally: 

,. ,x ,/, N ,/ X '^i <^(i-l) $ dR(h(a),h(b)) 
d{a,b) > d{b,xo) - d{a,xo) >-- -^-^ = -= ^^ ^// ^ " , 

so that h is 4t-Lipschitz. 

Observe that since /3($) < P{^{A)) < (log2 g)"^/^*"^) and w{x) < 1 
(point- wise), 

Y, yj{x)f^''^ > E H^f^"^^^^ > w{A)f^''^^^^ > w{Af"^^'^r^'''-'\ 

xeA' xeA' 
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Moreover, since ^{A) < <&/2: 






maXj^eA if (y) 
maXj^gA w{y) 



/3($/2) 



We deduce that: 

so that by (3), 

^ w;(x)'3(*) > J]; w;(x)^(*) + w{B) > 1, 
xeA'uB' xeA' 

as required. 

Proof of Claim 3.22. The fact that w is (/-decomposable imphes that we 
can spht M = N1IJN2, so that w{x) > ^ for every x ^ Ni and there is w > 
such that w{x) = to for every x G N2. We distinguish between two cases: 

Case 1. diamM(-^i) > -j"- In this case there are xo,Xq G A''! such that 
d(xo,XQ) > |. It follows in particular that i3(xo,<&/4) n i?(xo,$/4) = so 
that by interchanging the roles of xq and x'q , if necessary, we may assume that 
w{B{xo,^/4:)) < ^^^ = i. Since xq G A^i, w{xo) > i. This implies that 
there exist i £ {1, . . . , t} such that 



since otherwise: 






4t 
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which is a contradiction. Fixing such an index i and defining A, B as in the 
statement of Claim 3.22 we get that: 



4t 



+ 



l-wj B( xo,— 



> 1, 



which proves (3). 



Case 2. diamAf (A^i) < -^. In this case take xq G M such that (i(xo, A^i) = 
maXa;gAf (i(x, A^i). We claim that this implies that A''ini?(xo, *J'/4) = 0. Indeed, 
otherwise it will follow that d{xQ,Ni) < <l>/4 so that by the choice of xq, for 
every x,y £ M, 

$ 

d{x,y) < d{x, Ni) + d{y, Ni) + dmm{Ni) < 2d(xo , iVi ) + - < ^, 

which is a contradiction. 

Set m = \N2\ and denote for i G {0, . . . , t}: 

_ |({xo}Ui3(xo,f))nJV2| 



m 



Note that since xq G ^^^2) ^ti ^ = eo < et < I. We claim that this implies that 
there is some i £ {1, ■ ■ ■ ,t} such that: 



(4) 



/3(*/2) /3($/2)-/3($) 



>£,;. 



Indeed, if we set a = log2(2g'I>) > 1, then if there is no such i we have for every 
iG{l,...,t}: 

(ta)2/' 



,.^(ta)2/* [t(„+l)]2/t 

1 1 

Denote b = ttiC")^''' [t(»+i)i^/* and c = (ta)^'*. The above inequality then 
becomes ej_i < {ei/ty. Iterating this t times we get: 



1 



m 



eo < 



< 



1 






Thus, 

but an application of the mean value theorem gives a contradiction, since: 



it'a' - 1) 



1 



1 



(ta)2A [t{a + 1)]2A 



> 



t^a^ 



>t 



2 tl+2A(Q+l)l+2/t 

\ 2 
1-2/t ' "^ 



, >83/4.i>i. 
a + 1/ ~ 4 ^ 
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Choose an index i £ {1, . . . ,t} satisfying (4) and let A,B be as in the 
statement of Claim 3.22 for this particular i. Observe that: 

B = M\B (xo, '-^\z)M\B (xQ, I) ^ ^^1, 



so that: 






= w'3(*)(e,_im)^(*/2) + w{N{) + (1 - ei)muj 

> {mujf^'^hi + w{Ni) + (1 - ei)mLO 

> mioei + w{Ni) + (1 - ei)mtj = w{M) = 1. 

This concludes the proof of Claim 3.22. D 

3.5. Passing from an ultrametric to a fe-HST. In what follows we show 
that every ultrametric contains large subsets which are embeddable in a fc-HST 
with distortion a < k. 

An unweighted version of the following result was proved in [5]. The 
bound for the weighted Ramsey function is a straightforward modification of 
the proof in [5]: 

Lemma 3.23 ([5]). For every k > a > 1, 

1 



V^fe(UM,a)> 



[loga ^1 



If k is large with respect to a then the bound of [5] provides a good 
approximation for ^^(UM, a). In fact, this is how Lemma 3.23 is used in 
Section 3.7. 

However, when k is close to a the bound in Lemma 3.23 is not good enough 
for proving our main theorem. We obtain bounds for this range of parameters 
in the following lemma (stated in §3.2 in slightly weaker form). 

Lemma 3.15. For every k > a > 1, 

V'fc(UM,a)>l 



[logfc/a a] ' 

Before proving Lemma 3.15 we require some lemmas concerning unweighted 
trees. 

Definition 3.24. Let /i > 1 be an integer and i G {0, . . . , /i — 1}. We say 
that a rooted tree T is {i, h)-periodicaUy sparse if for every I = i { mod h), 
every vertex at depth Mn T is degenerate. T is called h periodically sparse if 
there exists i £ {0, ... ,h — 1} for which T is (i, /i) -periodically sparse. 
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In what follows we always use the convention that a subtree T' of a rooted 
tree T is rooted at the root of T and that the leaves of T' are also leaves of T. 
We denote by Ivs(T) the leaves of T. 

Lemma 3.25. Fix an integer h > 1. Let T be a finite rooted tree. Then 
for any w : Ivs(T) -^ M^ there exists a subtree ofT, T', which is h periodically 
sparse and: 

v&vs{T') \ve\vs{T) 

The techniques we use in the proof of Lemma 3.25 are similar to those 
used in the proof of Lemma 3.23 in [5]. It can also be derived from a result 
in [11] concerning influences in multi-stage games. These facts are also closely 
related to an isoperimetric inequality of Loomis and Whitney [39]. 

Proof. For every i G {0,1, ... ,h — 1} let fi{T) be the maximum of 
X^t)eivs(T') 'w{v)~h~ over all the {i, /i)-periodically sparse subtrees, T' , of T. We 
will prove by induction on the maximal depth of T that: 

h-i ( ^ '-' 

i=0 \i;elvs(T) 

from which it will follow that maxo<j</i_i /i(T) > f^^giygm '"^('^) ) '* ' ^^ 
required. 

For a tree T of depth 0, consisting of a single node v, we have that fi{T) = 
w{v)~h~ and therefore 

llf^{T)>{w{v)'T^) =w{vf-\ 
i=0 

Assume that the maximal depth of T is at least 1, let r be the root of T and 
denote hj vi, . . . ,vi its children. Denote by Tj the subtree of T rooted at Vj. 
Observe that: 

fo{T) > max fh-i{Tj), 

and for i £ {I, . . . ,h — 1}: 

I 
f\{T) = Y,h-i{T,), 

By repeated application of Holder's inequality: 

I h-2 /h-2 I 

j=l i=0 \i=0 j=l 
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Therefore, by the induction hypothesis: 

h-l h-l I 

n h{T) > max A_i(T,) • J] E f^-^^^^ 

i=0 -"'- j=l j=l 



, h-l 

I /h—2 \ ft-i 



>maxA_i(T,).|E(n/^(^:') 



I /h—l \ h-i 



h-l 



j=l \i=0 



>(E E -w 

j,-=l„glvs(T,) 



/i-l 




D 



Before proving Lemma 3.15, we prove the following variant of a proposition 
from [4]: 

Lemma 3.5. For any k > 1, any ultrametric is k- equivalent to an exact 
k-BST. 

Proof. Let T be a labelled tree rooted at r. Define a new labelled tree 
T' as follows. Let u be a minimal depth vertex in T that has a child v for 
which A{u) ^kA{v). Let < i G N be defined via k' < ^ < k'+^. Relabel 
V by setting A'{v) = —^ > A{v), and replace the edge [u,v] by a path of 
length i whose labels decrease by a factor k at each step. Denote the tree 
thus obtained by T'. If we start out with an HST X with defining tree T, 
then the tree T' produced in this procedure defines a new HST. Iterating this 
construction as long as possible, we arrive at a tree T which defines an exact 
A;-HST. To prove that we have distorted the metric by a factor of at most k 
observe that by construction, for any x,y & X, lcay(a;, y) = lcaj,(x, y) and that 
for any w G T n T, At{v) < Af{v) < /cAt(w). D 

Proof of Lemma 3.15. Let h = [log^ /^ a] and let s = k^' . By Lemma 
3.5, X is s-equivalent to some exact s-HST Y via a noncontractive s-Lipschitz 
embedding. Let T be the tree defining Y . Lemma 3.25 yields a subtree S of 
T which is (i, /i)-periodically sparse for some i G {0, . . . , /i — 1}, such that 



^w{x) 



^w{g~\v))—> 

v<=s \xex / 

By attaching a path of length h — 1 — i to the root of S we may assume that 
S is {h — 1, /i)-periodically sparse. Similarly, by adding appropriate paths to 
the leaves of S we may assume that there is an integer m such that all the 
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leaves of S are at depth mh. Denote by r the root of S. We change the 
tree S as follows. For every integer < j < m delete all the vertices of S 
whose depth is in the interval [jh + 1, (j + l)h — 1] and connect every vertex 
of depth jh directly to all its descendants of depth (j + l)h. Denote the tree 
thus obtained by S' and denote by Y' the metric space induced by S' on Y. 
It is evident that Y' is an exact s -HST. We claim that Y' is s equivalent 
to a subspace of X via a noncontractive s Lipschitz embedding. Indeed, 
fix u, f G Y' and let w be their least common ancestor in S. If we denote by 
q the depth of ti; in S* then the key observation is that since S is {h — 1, h)- 
periodically sparse, q ^ {h — 1) (mod h). We can therefore write q = i + jh 
for some i G {0, . . . , /i — 2} and j > 0. If we denote by w' the least common 
ancestor of u, v in S' then by the construction, w' is in depth jh in S. Hence 
dY{u,v) = -rfjl: and dY'{u,v) = -^, so that: 

dY{u,v) < dY'{u,v) < s'^dY{u,v) < s ~^dy(u, v). 

This shows that Y' is s^~^ equivalent to Y via a noncontractive s'*^^ Lipschitz 

embedding. Since y is s equivalent to a subspace of X via a noncontractive s 

Lipschitz bijection we have that Y' is s^~^ equivalent to a subspace of X. 

Recall that s^ = k, and it remains to show that s^^^ < a. Indeed by our 

1 'i 

choice of /i, /i — 1 < log^^./^ a, or j—^ > log^{k/a). Therefore a''-i > k, and so 

s'^-^ = k~f^ <a. U 

3.6. Passing from a /c-HST to metric composition. In this section we prove 
that if a metric space is close to a /c-HST then it is very close to a metric space 
in the composition closure of a class of metric spaces with low aspect ratio. 

Lemma 3.16. For any a,(3 > 1, if a metric space L is a-equivalent to a 
/3a-HST then L is (1 + 2/ (3)- equivalent to a metric space in comp^(a). 

Proof. Let L be a metric space. Let k = f3a. Let X be a /c-HST such that 
there is an a Lipschitz noncontractive bijection f : L ^ X. Namely, for every 
x,y e L, dL{x,y) < dx {f (x) , f (y)) < adL{x,y). 

Let T be the tree defining X. For a vertex u £ T, let T„ be the subtree of 
T rooted at u. Let Xu (a subspace of X) denote the HST defined by T^ and 
Lu = f^^{Xu). Then diam(L„) < diam(Xu) = A(n). 

Our goal is to build a metric space Z £ comp^(a) along with a non- 
contractive Lipschitz bijection g : L ^ Z which satisfies for every x,y G L, 
dL{x,y) < dz{g{x),g{y)) < ( 1 -I- i j dL{x,y). We prove this by induction on 
the size of L. The inductive hypothesis needs to be further strengthened with 
the requirement that diam(Z) < diam(L) = A. 

Let r be the root of T, with A(r) = A. Let C denote the set of chil- 
dren of r. By induction, there exists for each child u £ C a metric space 
Nu £ comp^(a) and a noncontractive Lipschitz bijection g^ : L^ ^ N^ which 
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2' 



satisfies for every X, y G L„, dL^{x,y) < dN^{gu{x),gu{y)) < \l + pj dL^{x,y). 
Also diam(iVu) < diam(Lu) = A(n). 

Define a metric space M = (C, dpi) by setting for every u j/^ v £ C, 

duiu, v) = max{(iL(x, y); x e Lu,y £ L^}. 

Fix u^ V ^ C and x G Lu-,y G -^i,. Since dx (f (x) , f (y)) = A, we have that 
A/a < (iL(x, y) < A. It fohows that for every u ^ v ^ C and x G L^, y G L„, 

— < dL(x,y) < dM(u,v) < diam(L) = A. 
a 

Therefore $(M) < a and diani(M) < A. Also for every u,v,x,y as above, 
dL{x,y) < dM{u,v) < dL{x,y) + diam{Lu) + diam{Ly) 

= diix, y) + A(n) + A(t;) < ^^(x, y) + 2- 

Now, we let 

max„6c diam(iVu) , 1 A/a 

mmu^^eC-aMltt,-") 7 A/fc 

Define Z G comp^(a), by letting Z = Ma'[AA], where M = {Nu}u&c- Also 
define for every u £ C and x G X^, (7(x) = gu{x)- 

Let u,v £ C and x G Lm,?/ G L^,. When u = v the bound on the distor- 
tion of g follows from our induction hypothesis. For u ^ v, dz{gix),g{y)) = 
P'ldMiujv) = dMiu,v), which implies the required bound on the distortion 
of g, and the requirement diam(Z) < A. D 

3.7. Distortions arbitrarily close to 2. Our goal in this section is to prove 
the following theorem: 

Theorem 3.26. There is an absolute constant c > such that for any 
k > 1 and < e < 1, for any integer n: 



-Rfc-HST(2 + e, n) > n>°s<2'=/o . 
In particular, 



(2/0 , 



i?UM(2 + e,n) > n>°s 



By Proposition 3.4 the same bound holds for -^2(2 + e, n). 
As in the case of large a, we derive Theorem 3.26 from the following 
stronger claim. 
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Theorem 3.26'. There is an absolute constant c > such that for any 
k>l and < e < 1: 

V'fc(2 + e)> 



log(2fe/e) 



The proof of Theorem 3.26' uses most of the techniques developed for the 
case of large a. The basic idea is first to apply Theorem 3.7' to obtain some 
constant a' for which there is a constant bound on ip{a'), e.g. 1/2. So, our 
goal is to find another subspace for which we can improve the distortion from 
a' to 2 + e. Again, we would like to exploit metric spaces with low aspect 
ratio <&. For large a we could do this with $ bounded with respect to a. 
Since we started with some constant a' we can expect ^ to be constant as 
well. However, the bound of Lemma 3.14 does not apply for small values of a. 
Thus, our first step is to obtain meaningful lower bounds on 7/;fc(<&,2 + e) for 
every e > 0. This is done by giving a lower bound on V'eq(^) 2 + e), that is by 
finding a large equilateral subspace, which is a special case of a A;-HST. We can 
now apply Lemma 3.11 to get lower bounds on '(/'fc(comp^($), 2 + e). For large 
a we were able to extend such bounds by finding a subspace close to a /c-HST, 
and therefore very close to a metric space in comp^(<I)) via Lemma 3.16. In the 
present case, "very close" means distortion ~ 1 + e, which implies that k and /3 
must be in the range of 1/e. This is achieved by initially applying Lemma 3.23 
to get a bound on ipk{a'). 

We begin with a proof of the bound on '0fc($,2 + e), which is based on 
bounds on embedding into an equilateral space. We start with the following 
result: 

Lemma 3.27. Let a > 2, s > 2 be real numbers and t > 1 be an integer. 
Let M be an n point metric space. Then at least one of the following two 
conditions holds: 

(1) M contains a subspace N of size at least s that is a- equivalent to an 
equilateral space. 

(2) M contains a subspace N of size at least n/s^, such that diam(iV) < 
(a/2)^*diam(M). 

Proof. By induction on t. Suppose that M has no subspace of size s that 
is a equivalent to an equilateral space. For t = 1 let Nq = M. For t > 1 we 
get by the induction hypothesis there is a subspace Nt-i C M which contains 
at least n/s*~^ points and diam(A't_i) < (a;/2)~*^^ diam(M). 

Let {ci, . . . , Cr} be a maximal subset of Nf-i such that 

d{ci,Cj) > diam(A^i_i)/Q 

for i 7^ j. Since {ci,...Cj,} is a equivalent to an equilateral space, our as- 
sumption implies that r < s. Let Ci = Nt-i n B{ci,diani{Nt-i)/a). By the 
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maximality of r, U^^iCi = Nt-i, and so if we set Nt to be the largest Cj, we 
have that its cardinahty is at least |A't_i|/r > n/s^. Now: 

diam(7Vt) < diam{B{ci,diam{Nt-i)/a)) < | diam(iVt_i) < (|)*diam(M). 

D 

This implies a bound on the cardinality of a subspace that is a-equivalent 
to an equilateral space. 

Corollary 3.28. Fix a > 2 and an integer n > 4. Let M be a metric 
space of size n. Then, 



^^^^gyiog.,.*(M)l-^^,^,^^^^^^(^^^-.^ 



Proof. Apply Lemma 3.27 with t = logQ,/2^(^) and s = (n/2)^/*. 
We obtain a subspace N of M. All we have to do is verify that with these 
parameters the second condition in Lemma 3.27 cannot hold. Indeed, otherwise 
|A^| > n/s* = 2 so that diam(A^) > mmx^ydMix,y), and it follows that 
(q/2)* < <1>, which contradicts the choice of t. D 

We show next that Corollary 3.28 implies bounds for the weighted Ramsey 
problem, and so we can bound '0eq(*^) a) for any a > 2. Since an equilateral is 
in particular a fc-HST, we get a bound on V'fe(^) ct)- To obtain this we need to 
extend the bound in Corollary 3.28 to hold for the weighted Ramsey problem. 
To achieve this we make use of another lemma from [5], which is similar in 
flavor to Lemma 3.20: 

Lemma 3.29 ([5]). Let x = {xi}"^-^ be a sequence of nonnegative real 
numbers. Then there exists a sequence y = {yjji^i such that r/i < xi for all 
i > 1 and: 

Moreover, one of the following two cases holds true: 

(1) For all i>2, yi = 0. 

(2) There exists a; > such that for all i > 1 either yi = uj or yi = 0. 
Corollary 3.30. For any /c > 1, a > 2 and $ > 1, 



V'fc(^,a) > V'EQ(<5,a) > - 



loga/2 ^ 



-1 
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Proof. Let M be an n-point metric space with aspect ratio <I>(M) < <I>. Let 
w : M ^ M+ be a weight function normahzed so that J2x£M'^i^) ~ ^- ^PP^Y 
Lemma 3.29 to the sequence {'w{x)}x£M to obtain a sequence {'w'{x)}x(=m such 
that for all x G M, w'{x) < w{x). In addition, either (i) There are u,v £ M 
such that w{u)^'^ + w{vy'^ > w'{u)^'^ + tf'(t;)^'^ > 1, or there is a subset 
N C M such that for all x e N, w{x) > w'{x) = to > 0, and \N\uj'^/'^ > 1. In 
the first case the subset {u, v} is isometric to an equilateral space and we are 
done. In the second case, if |A^| < 4 then uj > 1/16. Hence, we can choose two 
points u',v' £ N such that w{u'y/^ + w(f')^/'' > 2a;^/'' > 1. Again, {u',v'} is 
isometric to an equilateral space. Otherwise, |A^| > 4 and by Corollary 3.28 
there is a subspace N' C N which is a-equivalent to an equilateral space and 
\N'\ > |iV|2N-/^*r\ Hence: 



E 

x£N' 



,,-. ,,. _^| 



tt;(x)4r'°s=/2*r >|iV|Ki°s«/2*r u;-^\^°s 



,/2 



(iiv|L^^/2)iri°sc/2*r' >i. n 



Proof of Theorem 3.26'. Theorem 3.7' implies in particular that there is 
a constant 9 for which ip{9 /2) > 1/2. In other words, given a metric space X, 
there exists a subspace X' of X which is (^/2)-equivalent to an ultrametric Y 
and satisfies the weighted Ramsey condition (*) with ij: = 1/2. 

Let (3 = Sk/e and k' = 6(3. It follows from Lemma 3.23 that Y contains a 
subspace Y' which is 2-equivalent to a /c'-HST and satisfies condition (*) with 
V'fc'lUM, 2) > [log A;']^-'^. By mapping X into an ultrametric Y and its image 
in Y into a fc'-HST, we can apply Lemma 3.10 to obtain a subspace X" of X 
that is {9/2) ■ 2 = ^-equivalent to a fc'-HST, and satisfies condition (*) with 

V'fc'W > Vfc'(UM,2) • V(e/2) > ^r,\n - 

2 1 log k' I 

Now, X" is ^-equivalent to a ^/3-HST and so Lemma 3.16 implies that it is 
(1 + 2//3)-equivalent to a metric space Z in comp^(^). Therefore 

1 



V'comp,(e)(l + 2//3)>^fc'W 



> 



2 [log A;']' 

Additionally, using Lemma 3.11 and the bound of Corollary 3.30, we have that 
there is a constant c' such that 

ijk (comp JO), 2 + ^)=ijJe, 2 + ^)> 



4/ "^ V ' 4/ - log6' 

It follows that Z contains a subspace Z' which is (2 + e/4)-equivalent to a 
A:-HST. 

By mapping X into Z G comp^(^), and then its image in Z into a 
fe-HST, we can apply Lemma 3.10 to obtain a subspace of X which is 
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(2 + e/4)(l + 2//3) < (2 + e)-equivalent to a /c-HST and which satisfies the 
weighted Ramsey condition (*) with 

M2 + e) > ^. (comp,(e), 2 + 1) • ^_p^(,) (l + |) > 2 log g [4^/^)1 ' 
which imphes the theorem by an appropriate choice of c. D 

4. Dimensionality based upper bounds 

In this section we prove some upper bounds on the Ramsey function of 
low dimensional spaces. In particular, these imply bounds on the Euclidean 
Ramsey function R2{a,n). In addition, these bounds show that the lower 
bounds for low dimensional ip spaces from Corollary 1.7 in the introduction 
are nearly tight. Our upper bounds on R2{a,n) for a < 2 improve the results 
of [17] by showing that for any a < 2, R2{a, n) < 2 log2 n + C{a). The bounds 
obtained on R2{a,n) for 2 < a < log n/ log log n, are also possibly tight. 

The proof technique we employ here originates from a counting argument 
by Bourgain [15] and later variants (see [42]). A different argument, based 
on geometric considerations, uses expander graphs. Expander graphs, in fact 
yield the best upper bound we have on Rp{a, n) for a > 2 and all p > 1. This 
is shown in Section 5. 

In this section we prove the following bounds: 

Theorem 4.1. Let X he an h- dimensional nornied space and n be an 
integer. Then 

• For any 1 < a < 2, Rx (a, n) < 2 log2 n + 2h log2 ( 2^ ) • 

• For any a > 2, Rx{a,n) < Cn^~'^''^hloga 

where c, C > are some absolute constants. 

Using the Johnson-Lindenstrauss dimension reduction Lemma [31] we 
derive the following bounds for the Euclidean Ramsey function R2{a,n). 

Corollary 4.2. There are absolute constants c, C > such that for 
every integer n, 

• For any < e < 1, i?2(2-e,n) < 21og2 n + C ^"^^^ 

• For any2<a< j^^, i?2(a,n) < Cn^-'/''. 

The counting argument presented below is based on the existence of dense 
graphs for which all metrics defined on subgraphs are very far from each other. 
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Let A be a set of vertices in the graph G = {V, E). We denote by Ea the 
set of edges in G with both vertices in A, and the cardinahty of Ea by ca- 



^A 



The density of A is ttxtT' 

We first explain the relevance of large girth and high density to our prob- 
lem. Let G = (y, E) be a large graph of girth g with no large sparse sub- 
graphs. With every H O E we associate a metric on V defined by ph{u,v) = 
min {g — 1, dn^u, v)} , where dn is the shortest path metric in the subgraph of 
G with edge set H. Below we show that among these pn are metrics that can- 
not be embedded with small distortion in any low-dimensional normed space. 

Lemma 4.3. // there exists a graph G = {V,E) of size n with girth at 
least g, in which every set of>s vertices has density at least q, then for every 
h- dimensional normed space X and every real 1 < a < g — 1, 



1 + 2 
Rxia,n) < max 



/ Uag \ n 

^log2 7 + log 

\g-a-lj 

Proof. To prove the theorem, we may certainly assume that Rx{a, n) > s. 
Let k = Rx{(x, n); namely, every n point metric space contains a subset of size 
k that a-embeds in X. In particular, for every H C E there is a set of k vertices 
Ah ^ V such that (Ah, Ph) a-embeds into X. Therefore there is a certain set 
A oi k vertices, that is suitable for many sets H C E. That is, there is a class 
n of at least 2l^l/(^) subsets i? C £' for which A = Ah, and therefore {A, pn) 
a-embeds into X. Consider Hi,H2 G 7i equivalent if Hi n Ea = H2 H Ea- 
There are at most 2'^''^-^ members in Tt that are equivalent to a given set H. 
Consequently, there are at least 2^'*/(^) subsets H C E which are mutually 
inequivalent and for which (A, pn) a-embeds into X. Let fu'-A^Xhe such 
an embedding, i.e., for every u,v ^ A: 

-Ph{u,v) < Wfuiu) - fH{v)\\x < Ph{u,v). 
a 

Since pn takes values in {0, 1, . . . , (7— 1}, by applying an appropriate translation 

we may assume that fni^) ^ BxiO,g). We now "round" the images fniA) 

to the points of a (5- net in BxiO,g), where 5 will be determined soon. Let M 

be a (5- net of Bx{0, g), and define (pniv) to be the closest point in M to fniv)- 

We claim that if Hi,H2 ^ E are inequivalent, i.e.. Hi n Ea / -ff 2 H Ea, then 

(pHi 7^ 4'H2- Indeed, we may assume that there are u,v G A such that {u,v) £ 

Hi \ H2. Since the girth of G is at least g, this implies that phA'^, v) = g — 1, 

whereas pHi{u,v) = 1. Now, if (pHiiu) = (^//^l^) ^^d </>//j(f) = (pnA'^) then: 

g-l _ PH2 {u, v) 
a a 

< WfnM - fnMWx < 26+UhA^) - <PhM\\x 

= 26 + UhAu) - <PhM\\x < AS+UhAu) - fHAv)\\x 
<4:6 + phAu,v) =4(5 + 1. 
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We select 6 = ^^^^ so that this becomes a contradiction. It follows that each 
of the aforementioned 2^-* / (^) inequivalent sets H ^ Tt gives rise to a distinct 
function (pn '■ ^ ^ ■^■ 

By standard volume estimates, \M\ < {-§-)^- Hence there are at most 
\J^\ < {~§')^^ distinct functions from A to J\f . Consequently, 

By estimating (^) < (^)'' < (^)^ we have that 

Mog, ^-J > ^^ - log, (- 
which yields the claimed bound on A;. D 

Such graphs do exist as we now show: 

Lemma 4.4. For every integer g > 4, there exist graphs G = iV^E) of 
arbitrarily large order n and girth at least g in which every set A C V of 
cardinality at least n ^g has density at least n ^g . 

Proof. This is a standard construction from random graph theory. Let 
A^ > C^ be an arbitrarily large integer, where C is an appropriately chosen 
constant. Let ?? = jr. Pick a random graph in G{N,p) where p = 2 ■ N~^~^'^^. 
We claim: (i) With probability > 2 this graph has fewer than ^ cycles of 
length < g, and (ii) With almost certainty, every set of cardinality > (y) ~ 2 
has density > (y)"^"''^''- The theorem now follows by taking a graph with 
these two properties and removing ^ vertices, including at least one vertex 
from each cycle of length < g. The resulting graph has ^ vertices, it has no 
short cycles, and satisfies the density condition. 

The expected number of cycles of length < g is 
9-1 



—p'N{N-l) . . . (N-i+l) < I Y^^pNY < {pNy-^ = (2 ■ N^y ^ 



N 

i=3 i=3 

In the last inequalities we use the facts that pN > 2, N > C^, and C > 4. It 
follows that with probability > ^j there are no more than N/2 cycles shorter 
than g. 

The expected density in every set of vertices is, of course, p. To estimate 
the deviation, we use the Chernoff bound: 

Thus, the probability that there exists a set of cardinality > k and density 
<p/2 does not exceed 2^ exp(-(^)p/8). For k = (f )^-2 , p = 2- A^-^+^r?^ ^^^ 
the assumption N > C^, this is easily seen to be o(l). The claim follows. D 
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Theorem 4.1 now follows easily: 

Proof of Theorem 4.1. The claim for a > 2 is obtained combining 
Lemma 4.3 and Lemma 4.4 with g = \a + 2\. As 1/q < s < n ~s^ , we obtain 
that for an appropriate choice of constant C, 

Rx{c(,n) < Cn^^'^ {hloga + log{n'^)). 

We choose c = 1/16, so that n^-'^/^ = n 'Te^ > n ^5^ log(n~). The claim for 
a > 2 now follows. 

For the case a < 2 we use, instead of Lemma 4.4, the (trivial) analogous 
statement for the complete graph K^. That is, we apply Lemma 4.3 with 
5 = 3, s = 2, and q = 1. D 

We are now ready to prove the promised upper bounds on R2{a,n). 

Proof of Corollary 4.2. The result follows from the Johnson-Lindenstrauss 
dimension reduction lemma [31] for £2- Let a > 1, and let k = R2{a,n); i.e., 
every n-point metric space M contains a fc-point subspace that a-embeds into 
£2- By [31], for any < 6 < I, this subspace a(l + (5)-embeds into £2 '^i*^ 
h < — ^p— . Hence, R2{a,n) < R£h(a{l + 6),n). Now apply Theorem 4.1. The 
claim for a > 2 follows by taking 6=1. For a = 2 — e we set 5 = e/4. Then 

/ e \ Clog/clogf-) 
k = R2{2 - e,n) < Rfu [2- -,nj < 21og2n + 2 ^^ 

which implies the bound in the proposition. D 

Another interesting consequence of Theorem 4.1 are upper bounds for 
metrics defined by planar graphs. This may be interesting in view of the fact 
that the target metrics in our lower bounds in Section 3 are ultrametrics (and 
thus planar). 

Theorem 4.5. Let T he a family of graphs, none of which contains a 
fixed minor H on r vertices. Then for every integer n and every a > 1: 

Rjr{a,n) < Cr^n^'^^^log^ nloga, 

where c, C > are universal constants. 

Proof. It is implicit in [50] that the Euclidean embedding that Rao con- 
structed is also a good low dimensional embedding into ioo- More precisely, if 
F ^ J^ is a graph on n points then it embeds with distortion C into £^, with 
h < Cr^ log n. Thus Rjr{^a,n) < R^h [Ca,n). The result now follows from 
Theorem 4.1. D 
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5. Expanders and Poincare inequalities 

In this section we prove lower bounds for the metric Ramsey function 
in the case of expanders. The proof is based on generalizations of Poincare 
inequalities used by Matousek to prove lower bounds on the Euclidean distor- 
tion of expander graphs. To obtain these inequalities we pass to a power of 
the graph and delete vertices with small degree. The argument shows that 
large subsets of expanders contain large sub-subsets which satisfy an appro- 
priate Poincare inequality (see Lemma 5.4 below). First, we recall some basic 
concepts on graphs. 

Let G = (y, E) be a d-regular graph, and let A be its adjacency matrix, 
i.e. Auv = 1 if [n, v\ ^ E and A^v = otherwise. Let Ai > A2 > • • • > A„ be 
the eigenvalues of A. It is easy to observe that Ai = d. Also, trace(yl) = 0, so 
that Xn < 0. We occasionally write Xi{G) to specify that G is the graph under 
consideration. We define G's multiplicative spectral gap as: 

X2{G) 



7(G) 
he absolut 

7+(G') 



d ■ 
We also define the absolute multiplicative spectral gap of G as: 

maxj>2|Ai(G)| max{A2(G), -An(G)} 



d d 

In what follows we will use the following standard estimate: 7+(G') > 1/d. 
To verify it observe that nd = trace(A2) = ^^^^ Xi{Gf < d'^ + {n-l)[dj+{G)]'^ 
and use the fact that 1 < d < n — 1. We remark that this elementary bound 
is weaker than the Alon-Boppana bound [1], but it is sufficient for our pur- 
poses, and holds for all d (while the Alon-Boppana bound only holds for small 
enough d). 

The main statement of this section is: 

Theorem 5.1. Let G = {V,E) be a d-regular graph, d > 3. Let 7 = 
7-i-(G). Then for every p,a > 1: 

Rp{G;a) <Gd\V\^'^^^^^, 
where C, c are absolute constants. 

Given S,T O V, we denote by E{S, T) the set of directed edges between 
vertices in S and T; i.e, 

E{S, T) = {(n, f ) G 5 X T; [n, v] G E}. 

We also denote by E{S) the set of edges in the subgraph induced by G on S; 
i.e., 

E{S) = {{n, v}; u,v £ S, [u, v] £ E} . 

With this notation, \E{S)\ = M|^. 
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The "Expander Mixing Lemma" [1] states: 

Lemma 5.2 (Expander mixing lemma) . Let G = (y,E) be a d-regular 
graph {which may have loops and/ or parallel edges). Then for every S,T <^V, 



\E{S,T)\ 



d\S\\T\ 



V 



<7+(G)dv/|5im 



In particular, 



2\EiS)\ \S\ 



\S\ 



\v\ 



< l+{G)d. 



Lemma 5.3. Let G = [V, E) he a d-regular graph, d > 3. Let 7 = 7-|-(G). 
Then for any B C V satisfying \B\ > 8j\V\, there exists G C B such that 
\G\ > \B\/3, and for any u £ G, 

r\B\ , , , A\B\ 
d^zTTTT < deg,^(ti) < d- 



\V\ 



\V\ 



Proof. Denote k 



\B\. By the expander mixing lemma, 

dk^ 



E B) 



dk^ / 1 
- 2n V 8 



< 



n 



Set B' = {v £ B; deg5(f) < (4(i/c)/n}. Since the graph induced by G on i? 
contains k — \B'\ vertices of degree greater than {4:dk)/n, it follows that: 

de 
n 



> \E(B)\ > 



k-\B'\ 



Adk 
n 



so that \B'\ > k/2. Again, by the expander mixing lemma, 

2\E{B')\ 2\E{B')\ dk 



k 



> 



2\B' 



> 



An 



We now apply an iterative procedure which produces a sequence B' = 
Bq D Bi D B2 D . . . as follows: ifm.myi=B, deg^^(w) < |^ then -Bj+i is obtained 
from Bi by throwing away a vertex u £ Bi with deg£.{u) = min^g^i deg^.(t;). 
Otherwise i?i+i = Bi. This procedure eventually ends, and we are left with 
a subset G ^ B' . Since at each step we delete at most ^ edges from B' , we 
have that: 



\E{G)\ > \E{B') 



de dk^ 
8n ~ An 



d^ 
8n 



8n 
d^ 

8n ' 



Note that the graph induced on C by G has minimal degree at least g^. 
To estimate \G\ we apply the expander mixing lemma to get that: 



2\EiG)\ 
\G\ 



<d 



\G\ 



n 



+ 7 
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Thus 

2n 2 - ' ^ ^' - 8n 

Since k > 8jn, ^ < f^. Hence: 

2n ~ 16n' 
so that \C\ > |. D 

The following is the Poincare inequality used in the proof of Theorem 5.1. 

Lemma 5.4. Let G = {V,E) he a d-regular graph, d> 3. Let 7 = 7+(G). 
Then for any B C V satisfying \B\ > 8^\V\, there exists C C B such that 
\C\>\B\/3 and the following holds true: For any p>l and for every f '.C ^ip: 

E ||/(n) - f{vX < ^^^^ Yl ll/H - /(^)llp- 

u,vGC [u,v]£E{C') 

The proof of Lemma 5.4 proceeds by first proving a slightly stronger ver- 
sion of it for p = 2 and then extrapolating to the general case via the following 
lemma based on an extrapolation argument which was used in [41]. Its proof 
is delayed to the end of the section. 

Lemma 5.5 (Extrapolation lemma for Poincare inequalities). Let G = 
{V,E) be a graph with maximal degree at most A. Fix p > 1 and let A > be 
a constant such that for every / : 1/ ^ M, 

(5) E |/(n) - f{vr < {Apf^^ E l/(^) - /Wl'- 

u,v£V lu,v]eE 

Then for every < q < p and for every f : V ^ iq, 

u,v£V [u,v]£E 

Additionally, for every p < q < 00 and every f : V ^ ig-. 

E \\fiu)-f{v)\\l<{AAqy^-^ E ll/(^)-/WII^- 

u,v£V \u,v]£E 



Proof of Lemma 5.4. Denote n = \V\ and k = \B\. By Lemma 5.3, there 
exists C C B with \G\ > k/3, such that the induced subgraph of G on C, has 
minimal degree at least kd/8n and maximal degree at most A = Akd/n. 
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(6) 



We first prove that the fohowing inequahty hold true for every f '■ V ^ £2'- 

32n 
T 



lu,v]GE{C) 

32- 4k 



u,v<=C 



A 



E ii/(^)-/wii2- 



[u,v]eE{C) 

By summatfon we may clearly assume that / : C ^ M. By translation 
we may assume that J2vec fi''^) ~ ^- Extend / to F by letting f{u) = for 
u ^ C. Now, 

E [/(^) - /w]' = 2(ic^i + 1) E /(^)' - 2 E /(^)/(^) 



u,vGC 



■uec u,vgc 

^2 



= 2(|ci + i)E/w'- E/(^) 

= 2(|C| + l)E/W'<4A:E/(^)'- 
Since J2veV /("") = we can use the spectral gap of H to get that: 
E [/(^)-/W]' = 2Edeg^(C)W/W'-2 E /(^)/(^ 



[«,i;]G£(C) 



dGC [m,d]G-E;(C) 

^2 E £/(-)' -2 E /(-)/(-) 



4n 



t;ey 



^£E/w^-7^E/(^ 



4n 



j>ev 



?)GV 






which implies inequality (6). The Poincare inequalities for p > 1 now follow 
immediately from inequality (6) via Lemma 5.5, and by substituting the value 
of A. D 

Proof of Theorem 5.1. The proof proceeds by showing that for every 
B C V satisfying Cp{B) < a, 

\B\ < 100d\V\^~ 2=eip° . 
Set k = \B\ and n = \V\. Define: 

log(8n) 



2560pa log d + log ( ^ 



ON METRIC RAMSEY- TYPE PHENOMENA 693 

Note that t < diam(G) since it is well known that diam(G) > log^(n). We 
may also assume that t > 1, since otherwise, using the fact that I/7 < d, we 
get that n < gSseipaiogd^ j^^ which case the required result holds vacuously. 

Denote by A the adjacency matrix of G. Let H be the multi- graph with 
adjacency matrix A*. In other words, the number of i?-edges between two 
vertices in V is the number of distinct paths of length t joining them (and 
it is if no such path exists). The multi-graph H is d* regular and by the 
spectral theorem, 7_|_(i7) = 7*. We may assume that k > 87*72 (otherwise the 
conclusion of the theorem in trivial) . 

It follows from Lemma 5.4 that there exists C d B such that \C\ > fc/S, 
and for every / : C ^ ip-. 

u,veC [u,v]eEH{C) 

Let f : B ^ ip he an embedding such that for all u,v G B, '^^"'^'^ < 
\\f{u)-fiv)\\p<dGiu,v). Then: 

E \\fi^)-fivWp< E dG{u,vr<\EH{BW<^^, 

[u,v]£Eh{C) [u,v]£Eh{B) 

where the last inequality follows from an application of the expander mixing 

lemma: 

, , ,, d'e f l\ dV 
\Eh{B)\ <-— 1 + - < 



2n \ 8/ n 

Let s = |_log^ (12)] • ^^ rnay clearly assume that s > 1. The number of 
vertices of distance at most s from a given vertex vq G G is bounded by: 

l + d + --- + d' <2d' <-<^-^. 
- 6 - 2 

Hence: 

(7) E ll/(-)-/WII^>^ E ^o{u,vf>''^ 



k'^sP P 



ISaP ~ 80aP 

Plugging this into the Poincare inequality we get that: 
k 



log, ( ^ 



80aP 
which gives 



log, ( ^ 



{32p)Pn d^khP 



d* 



n 



logrf (^) - 2560pat ^ A: < Ud^^^^P'^K 
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Since t < 2ILiL! — ^_^ [i follows that: 

2560pa log d+log ( ^ j 

log(8n) • 2560palogd 



k<12 exp 



2560pa log d + log ( ^ 



1 log(l/T) 1 l°Sd(l/^) 

< lOOn 2560pc,logd + log(l/-,) < lOOn aseipc ^ 

where we have used once more the estimate log(l/7) < logd. D 

It remains to prove Lemma 5.5. 

Proof of Lemma 5.5. The case < q < p is simple. Coordinate-wise 
summation of (5) shows that for every / : V ^ Ip: 

u,vGV [u,v]eE 

Since £2 is isometric to a subspace of Lp, it follows that for every f '■ V ^ £2, 

u,vGV [u,v]gE 

Since (M, |x — t/l ) is isometric to a subset of £2 ([52], [22]), the required 
inequality follows. 

We now pass to the case p < q. In this case the following standard 
numerical inequality holds true for every a, 6 G M (see Lemma 4 in [41]): 

(8) ||a|'?/*'sign(a) - |6|«/Psign(5)| < % - b\ (laf^-^ + |6|p"^ 

I ' p \ 

It is suffices to prove the claims coordinate- wise, i.e. for functions f -V - 
Fix some f:V^M. By continuity there is some c G M such that: 

^|/(^)+c|'?/Psign(/(t;)+c)=0. 

vev 

Hence, by replacing / with / + c we may assume that: 

El/(«)l'^/^sign(/(t'))=0. 

veV 



ON METRIC RAMSEY- TYPE PHENOMENA 



695 



Now: 



v<^V 



dGV 



E l/WI' = E l/Wr/^sign(/(t')) - 4 E l/Mr/^sign(/(^)) 
^ E \\f{n)fPsign{f{u))-\f{v)fP^ignU{v))\'' 



\V\ 



u,v£V 



P 



< 



< 



jAp) 

A 

{Aq)P 



E \\f{uW/Psign{fiu)) - \f{v)\'^/Psign{fiv))\' 



\u,v]eE 



E i/H-/wr(i/wi'"' + i/wi'"' 



[u,v]&E 

where in the last two steps we have used (5) and (8), respectively. An appli- 
cation of Holder's inequality gives that: 

E \fiu) - f{vr ■ {\fiu)\i-' + \f{v)\'^-'^'' 

[u,v]£E 

(\p/g/ \ 1-- 

E I/W-/WIM E (i/wi'"' + i/wi'"' 
[u,v]<^E / \[u,v]eE 

Using the assumption on the maximal degree we get that: 



— 1 \ q-p 



E (1/(^)1'"' + i/( 

u,v]£E 



,,--1 \ 1- 
V]\ p 



^<2^-i E (i/wr + i/wr) 



M'^E 



<2^AEi/wr- 



vev 



Summarizing, we have shown that: 
{2Aq)P 



vl<i 



\-i 



E i/wi"^ ^ ^^ E i/w - /wi^ E i/w 



This inequality simplifies to: 



\v(^V 



E 1/(^)1'^^^ E i/w-/wi^ 



tiev 



fu,flg_E 



We conclude by noting that: 



E i/(^)-/wr<2'^ii^iEi/wi'- 



D 
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We now show that the interplay between the Euchdean distortion and the 
cardinahty in Theorem 5.1, for p = 2, is tight, up to the dependence on d 
and 7. We require an upper estimate for the diameter of an n-point expander. 
It is well known that the diameter is O(logn), but here we will be a little bit 
more accurate. 

We need the following bound on the diameter of expander graphs [20] : 

Proposition 5.6. Let G = {V,E) be an n-vertex, d-regular graph. De- 
note 7 = 7+(G). Then the diameter of G is at most logii^ n + 1. 

Proposition 5.7. Let G = {V, E) he an n-vertex, d-regular graph, d > 3, 
and set 7 = 7+(G). Then, there is an absolute constant C > such that for 
any a > 1, 

R2{G;a) > REQiG;a) > n'-^^^aW). 

Proof. Iteratively, extract a point x £ V together with a ball of radius 
r = diam(G)/a around x. Each such ball contains at most d + d{d — 1) + • • • + 
d{d — 1)I-'"J < 3{d — ly^^ points, and thus we can repeat this process at least 
n/{3{d — l)^"*"^) times, and get the desired set. Its size is at least 



3(d 



n n 



n , , , '""i/-^ " 1 1- 



-3(d-l)2^ ' 3(d-l)2 n 



6. Markov type, girth and hypercubes 

Markov type was defined in [2] and was applied in [38] to obtain lower 
bounds for the Euclidean distortion of regular graphs with large girth. This 
concept plays a key role in our analysis of the metric Ramsey problem for 
the discrete cube and graphs with large girth. Let {X, d) be a metric space. 
We shall say that {M^j^g is a stationary time-reversible Markov chain on X 
if there are xi, . . . ,x„ G X, an n x n stochastic matrix A and a stationary 
distribution vr = (tti, . . . ,7r„) of A such that for every i,j, iTiAij = iTjAji, 
{Mfcj^Q is a Markov chain with transition matrix A and Mq is distributed 
according to tt. {X, d) is said to have Markov type p > with constant C if 
for any stationary time-reversible Markov chain on X, and for any time s: 

E[d{Zs,Zor]<CPsE[d{Zi,Zor]. 

In [2] (see also [38]) it was shown that Hilbert space has Markov type 2 
with constant 1. Actually, these references deal with the special case in which 
A is symmetric and vr is the uniform distribution on the states xi,...,Xn, 
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but the proof is easily seen to carry over to stationary time-reversible Markov 
chains. 

6.1. Graphs with large girth. For later applications, it will be convenient 
to introduce a notion of "Euclidean distortion at small distances" as follows. 
Let (X, dx), iXi dy) be metric spaces and s > 0. For every injective f : X ^ Y 
define: 



d\s&\f) 



sup 



dYifix)Jiy)) 



sup 



dx{x,y) 



0<dx{x,y)<s dxix,y) J \o<dx{x,y)<sdYif{x)J{y)) J ' 



and: 



4\x)=mf[dis&\f); f:X^Y] 



»/ 



As before, we write c"2'{X) = cf (X). 

Let G = (y, E) be a graph. In what follows we denote by 6{G) the average 
degree of G, i.e. 

^ '~ \v\ "W 

We begin with the following strengthening of a result from [38]. 

Theorem 6.1. Let G = (V, E) he a graph with girth g and average degree 
5 = 6{G); then for every integer 1 < s < g/2, C2 (G) > -^^/s. In particular., 

5-2 



C2{G) > 



g_ 

2J 



1. 



Proof. Assume first that G is connected. Consider the reversible Markov 
chain {-^fejfcLo that corresponds to the canonical random walk on G. Recall 
that vr^, = deg{v) / {5n) is a stationary distribution of this Markov chain. 

For every 1 < s < g/2, 

deg(w) — 1 



E[dG(Zs,^o)]>IE.eV 



+ 



deg{v) 
1 



E, 



'v&V 



deg(t; 
deg(t;) — 2 
deg{v) 



{¥.[dG{Zs-i,ZQ)\Zs-i = v\ + l) 

^[dG{Zs-uZQ)\Zs^l=v]-^ 

+ ndG{Zs~i,Zo)\Zs^i = v] 



¥.[dG{Z,_uZo)] + l-Y,^v 



deg{v) 



^-^ on deg(f) 



¥.[dG{Zs-i,Zo)\ + 



6-2 
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By induction K[dG{Zs, Zq)] > s— j— . Therefore 

^2l \ FTP^^Y'z v_m2^„2/'^-2 



E[dG{Zs, ZoY] > [mdZs, Zo)Y >s\ ^ 


On the other hand, since Hilbert space has Markov type 2 with constant 1, 

E[dG{Zs,Zof] < ci'\GfsE[dG{Zi,Zof] = c^i\G)h. 

If G is disconnected, there is a connected component C of G in which 
the average degree is at least 6 = 6{G). The theorem follows by applying the 
above proof to the connected graph spanned by G. D 

Let G = (y, E) be a d-regular graph, and let A be its adjacency matrix, 
and Ai > A2 > • • • > A„ be the eigenvalues of A. Recall (see Section 5) 
that the multiplicative spectral gap of G is 7(G) = '^ \ and the absolute 
multiplicative spectral gap of G is 7+(G) = "^ j'~ " . For S* C F, 

let E{S) = {{n, f}; u,v £ S, [u,v] G E} . Recall that the Expander Mixing 
Lemma implies that 



ms)\^^ 



^-74G) 



\S\ 

This statement motivates the following useful definition: 

Definition 6.2 (self mixing parameter) . Let G = {V,E) be a d-regular 
graph. The self-mixing parameter of G is defined as: 

The Expander Mixing Lemma implies that /i(G) < 7_(.(G). We have in 
fact the following estimate: 

Lemma 6.3. Let G = {V,E) be a d-regular n-vertex graph, let A be G's 
adjacency matrix and let d = Xi > ■ ■ ■ > Xn be its eigenvalues. Then: 

MO) < ^. 

Proof. Let wi, . . . ,Wn be an orthonormal system of eigenvectors for A 
with Awi = XiWi for i = 1, . . . ,n. Let 1 = ly be the all-ones vector; then 
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wi = r. — 1- Let Is be the indicator of some subset S" C y. Then, 

2\E{S)\ = {Als,ls) 

n n 

^^'^{is,Wi)'Wi, ^(15, Wi)'Wi 

i=l i=l 

n 



■-^{ls,w,i)'^Xi 



i=l 

n 

2 



\s\ 



i=l 
d + Xn\S\. □ 



n 
Lemma 6.4. Let G = iy,E) he a d-regular graph with girth g and put 

is) 

fj, = /u(G). Fix B CV, 1 < s < g/2 and denote a = C2 {B^dc)- Assume that 
a^ < s. Then, 

Proof. Set \B\ = k and \V\ = n. By the definition of the self mixing 
parameter, 

2\E{B)\ >^^-^idk. 
n 

Consider the graph on B induced by G (i.e. the edges are the edges of G which 
are also vn B x B). Its girth is not less than g and its average degree is: 

k n 

Moreover, since G has girth g and s < g/2, lidsiu, v) < s, for some two vertices 
u,v ^ B, then dB{u,v) = dc{u,v). Consequently, C2 {B^ds) < C2 {B^dc) = 
a. An application of Theorem 6.1 yields: 

a>(lj)vi, 

so that, 

dk , ^ ^ ^ 2 
— - fid<6 < —, 



which gives: 



n 1 — ^ 



2n 
k < fm -\ y ^. D 
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Let G = (y, E) be a graph and 1 < t < diam(G) be an integer. We define 
the t-distance graph of G as G^^' = {y,E^*') where [u,v\ G E^*' if and only if 
dG{u,v) = t. We cohect below some properties of G^^ (part 6 of the lemma 
below will not be applied in the sequel, and is included here for possible future 
reference) . 

Lemma 6.5. Let G = (y, E) he a d-regular graph, d > 3, with girth g and 
letl <t < g/2. Then: 

1) G^*-* is a d{d — 1)*~^ regular graph. 

2) The girth of G^^' is not less than g/t. 

3) For every n, w G y, 

, ( ^ ^ 5 ^ , / X dG{u,v) 

dew [u, V) < — =^ dew [u, V) — 



2t "^^'v"-^ t 

4) For every B <ZV and 1 < s < ^^, c'^\B,dGw) < C2{B,dG). 

5) Ift is even then fi{G^^^) < 8{d - l)"*/"^. 

6) Ift is odd then 7(GW) < Se'^^'^^^'^Ws , 

Proof. Since G has girth g and t < g/2, the number of vertices with 
distance t from a given vertex is the number of leaves of a d-regular tree of 
depth t, which is d{d — 1)*^-*^. This proves 1). The statements 2) and 3) are 
also simple consequences of the fact that G has girth g. Assertion 4) follows 
immediately from assertion 3). 

To prove assertion 5), note that the adjacency matrix of G^*', A*-*), is the 
i-distance matrix of G; i.e., Aui = 1 if dciu^v) = 1 and otherwise. If we 
denote by A the adjacency matrix of G then there exists a polynomial Pt of 
degree t such that A^^' = Pt{A) (this is the so-called Geronimus polynomial. 
The properties of the polynomials used here can be found e.g. in [13], [38]). 
The polynomial Pt has degree t; all its roots are real and reside in the interval 
[—2^/W^^,2^Jl^^^]. An explicit trigonometric expression for Pt is: 



Pt{2Vd- 1 cos 61) = (d- 1) 



,/2-i {d - 1) sin((t + 1)6) - sin((t - 1)9) 



smi 



Finally, if t is even, then Pt is an even function and if t is odd, then Pt is an 
odd function. The spectral theorem shows that {Pt{Xi{G))} are the eigenvalues 
of^W. 

We turn to estimate the smallest eigenvalue of A^^' , for t even. This eigen- 
value must be negative, but Pt is positive outside the interval [— 2\/d — 1, 2\/d — 1]. 
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In other words, ii Pt{x) < 0, then x G [—2\/d — 1, 2\^d — 1] and x = 1\/ d — 1 cos Q 
for some Q G [— 7r,7r]. Therefore, 

sin0 

Using the elementary estimate | sinra;| < r\ sina| for a G [— 7r,7r] and r > 1, it 
follows that: 

\Pt{x)\ <(i(t+l)(d-l)*/2-i. 

Hence, by Lemma 6.3, 

MG(*)) < '^'^,('jL'r)/-r" = (^ + m - ir" < m - ir". 

To prove assertion 6) we distinguish between two cases: 

Case one. \2{G^^') = Pt{\2{G)). In this case we apply the mean value 
theorem and find some a G (A2(G'), Ai(G)) such that: 



log 



7(G(*)) 



log 



Ptid) 



PtiHG)) 



[d-A2(G)]-^*^"^ 



Pt{a) 



[^-liG)]dJ2-l 



i=l 



a-Vi 



where yi are the roots of Pt- Therefore, 
1 



log 



7(G(*)) 



>[l-7(G)]t- 



d 



> 



[1 - 7(G)]t 



d + 2y/d^^ 2 

as claimed. 

Case two. X2{G^^') = Pt{Xi{G)) for some i > 3. We claim that Aj must 
be in the interval [—2\/d — 1, 2\/d — 1]. Recall that all the zeros of Pt are in 
this interval. It is impossible that Aj < —2\Jd — 1, since Pt < there (t is 
odd). Also, \i > 2\/ d — 1 is impossible, since Pt is increasing on \2\J d — 1, oo) 
and A2(G) > Ai(G), whereas Pt(Ai(G)) > Pt(A2(G)). Therefore, Ai(G) G 
\—2\/d — 1, 2\/ d — 1], and as in the proof of 5), we estimate: 



P*(A.(G))<d(i + l)((i-l)*/2-i. 



Hence: 



7(gW) = ^^^ < (t + i)(rf - 1)-*/^ < (t + 1) • 2-*/^ 

which implies the required result. 



D 



702 YAIR BARTAL, NATHAN LINIAL, MANOR MENDEL, AND ASSAF NAOR 

We can now prove an upper bound for the Ramsey problem for graphs 
with large girth. 



Theorem 6.6. Let G = {V,E) he a d-regular graphs d > 3 with girth g. 
is an absolute constant c > ■" 

R2{G;a) < 12{d - 1)-^^\V\. 



Let 1 < a < ^ . There is an absolute constant c > such that 



Proof. The proof proceeds by showing that for every B C. V such that 
C2{B,dG) < a, the following estimate holds: 

\B\ < 12(d-l)~^|y|. 

Let t be the unique even defined by -^ — 2<t< ■^. Put s = 4a^. Now, 
since s < ^, part 4) of Lemma 6.5 implies that: 

ci'\B,dGin)<C2{B,dG)<a. 

By Lemma 6.5, girth(G^*'') > g/t. Also, s < ^, so we can apply Lemma 6.4 to 
G*-*-*. Combined with assertion 5) of Lemma 6.5 we deduce: 

|5|<Ai(G(*))|y|+ ^'^' 



< 



d{d-iy-Ui 



8(d-l)-*/^ + 4((i-l)- 



/4a2 



\V\ 



<12(d-l)-*/*|F| 



t/4| 



iC-a- 



< 12{d - l)-i{^-^)\V\ < 12(d - 1)-^^\V\. D 

6.2. The discrete cube. The solution for the metric Ramsey problem for 
the discrete cube is also based on the notion of Markov type. The discrete cube 
has a small girth, and so other ideas are called for. Our analysis utilizes another 
family of orthogonal polynomials — the Krawtchouk polynomials which appear 
in many studies related to the discrete cube. 

Let k < d. The degree-A; Krawtchouk polynomial for the d-dimensional 
cube is: 

x\ I d — X 

... .k-J. 

Again we need an estimate for the smallest value that this polynomial 
takes. 

Lemma 6.7. Let I <k < ^ be even. Then: 



<^(-) = E(-i)^(-) 



^^^(^)^-(-j v^. 
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Proof. It is known (see for example [35]) that all k zeros of Kj^ are real 
and belong to the interval: 

^-y/{k-l){d-k + 2),^ + V{k-l){d-k + 2) 

Since k is even, Kj^ is symmetric around d/2 (i.e. Kj^ (x) = K^ 

■{d — x)). It is also easily checked that the leading coefficient of Kj^ is ^~^| . 
So Kj^ (x) < only for x in the above interval. Let zi,Z2, ■ ■ ■ ,Zk be the zeros 



of Af>(i) 



<w=|n( 



k j^ 

'Zi - X] 



and since x, zi, . . . , z^ are all in an interval of length 2y^(/c — l)((i — A; + 2) < 
^\Jk{d — k), it follows that for x in the interval above: 

To verify this inequality, note that after clearing equal terms, it reduces to 
[d{d—k)]''''^ < d{d— 1) . . . {d—k+l). This follows by multiplying the inequality 
d{d -k) <{d-j){d + j -k + 1) over j = 0, . . . , k/2. D 

Let Qd = {0, l}'^ be the graph of the d-dimensional cube. (Two vectors 
are adjacent if and only if they differ in exactly one coordinate.) As before, we 
consider the t-distance graph on the cube ^^ • It is well known (e.g. [21]) and 
easy to show ^ that the eigenvalues of the graph O^ are the numbers Kj: (i) 
for i = 0, . . . , d where the i-th eigenvalue appears with multiplicity (J. This 
graph is (j)-regular and so its largest eigenvalue is (J. Lemmas 6.3 and 6.7 
now yield an estimate for the self-mixing parameter fi{^y). 

Lemma 6.8. For every even integer 1 < t < d/2, 



t/2 



f).(f) 

To prove the main result of this section, we need an additional estimate. 



^To show this, recall the 2"^ Walsh functions {Ws \ S £ ^d} that are defined via Ws(T) = 
(— 1)' '"^' . It is not hard to see that they form a complete set of eigenfunctions for Q^ and 



the eigenvalue corresponding to Ws is kI '{n — 2\S\). 
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Lemma 6.9. Let t,x,d be integers such that 2t < x < d/2. Then: 



^ (x\ (d-x\ ( 150x \ */•" (d\ 



i=t/3 

Proof. Clearly, (^) = J2j (^)(t-^)' s° i^ suffices to consider the range 

X < j^. In other words, we assume |<i<t<f< ggg- ^^ this range, the 

terms decrease geometrically, {jli) {f_(~^ii) < Joi^) (t-y)- ^^ therefore suffices 

to show that {t%){tt/l) ^ (i^)*/^ [f) . Recall the following elementary and 
well-known estimates of binomial coefficients: For every 1 < A: < n, 

n\i' /n\ i'en\k 
k) - \k) - \~k) ■ 

We plug this into the expression and simplify, to conclude that the inequality 
holds. n 

Theorem 6.10. There are absolute constants C,c,d > such that for 
every integer d and for every a > 1, 



2{^-'''^^)<i < R^(nd;a) < C2(^'^> 



Proof. We start with the lower bound. An easy fact from coding theory, 
called the Gilbert- Varshamov bound, [40], states that there exists a subset 
B Q^d such that all n 7^ w G i?, are at distance > ^, and: 



a^ 



LB > J- > 2I ^^^J"*, 

V ( ) ~ 

where the last inequality follows from standard estimates on binomial coeffi- 
cients. Note that for every u,v £ il^, \\u — v\\2 = ^/p{u,v), where p stands for 
the Hamming distance. But for every distinct u,v £ B, ^ < p{u, v) < d, so 

that ^ < 0^ < Vd. Consequently, C2{B) < a. 

To motivate the proof of the upper bound, let us sketch a proof based on 
Markov type for (a weakening of) the classical fact [25] that C2{^d) ^ ayd for 
some absolute a > (in fact, C2{^d) = Vd)- The random walk on il^ almost 
surely drifts with constant speed from its point of origin for time > a'd for some 
absolute a' > 0. This is true because a constant fraction of the coordinates stay 
unchanged for this duration. On the other hand, the fact that Hilbert space 
has Markov-type 2 implies that the corresponding walk on an image of Qd will 
typically drift only 0(vw) away from its origin. This discrepancy implies a 
metric distortion > cya, as claimed. The spirit of the proof we present is 
similar. We only have a subset i? C Q^, so we consider (a dense connected 
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component of) the graph f^J^ • The main technical effort is in estimating the 
typical rate of drift from the walk's origin. 

We wish to show, then, that ii B (1 0,^ satisfies C2{B) < a, then \B\ < 



C2i'- 



Let n = 2 and k = \B\. We seek an upper bound on k. As in the 



proof of Theorem 6.6, we investigate the random walk on the distance t graph 
of the graph in question, namely O^ . We define t as the even integer nearest 
to -j^^, where K is a suitably large absolute constant to be specified later. It 

can be verified that 2v ~'^) > 2n (^) for this choice of t. Therefore, we 
may assume that: 



k 

- > 2 

n 



64t\ 



t/2 



or else the required upper bound on k already holds. 

Denote by Ef{B) the number of unordered pairs of points of distance t 



in B. In terms of the graph $7^ this is: Et{B) 



l^n'" 



B)\. By Lemma 6.3: 



2Et{B) > 



)e 



n 



('("!,"; 



tj - n 



^)"^(>. 



so that: 



5^.. (B) 



2Et{B) 
k 



> 



64t 



t/2 



> 



k_ 
2n' 



There is a connected component C of the subgraph of Q^ spanned by B that 
has average degree 6' > 5, i.e.. 



6' = 6^i.,{C)>6> (f ) — 



k 
2n' 



Let {Zr}'^Q be the random walk on 17^ restricted to C. We start the walk at 
the stationary distribution, viz.. 



PiZo 



degr 



cy 



S'\C\ 



where deg^(f) is the degree of vertex w in fi^"^ restricted to C (i.e. the number 
of elements of C with Hamming distance t to v). 

Suppose that our random walk starts from S £ C and reaches, after some 
time, a vertex T with x = p{S,T). Say that we next step from T to W. We 
seek an upper bound on the probability that p{S,W) < x + |. The total 
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number of neighbors W oi T in Q, for which this holds is 



A{x) 



E 

i>rt/3i 



x\ /d — X 
j)\t-j 



By Lemma 6.9, A{x) < 2 (1^)*/^ (^) when 2t < x < f . 

Now for every possible walk, p{Zr, Zq) < rt holds for every integer r > 1. 
For times 2 < r < 2^ we are able to show that the walk tends to drift at least 
a"t per step away from its origin for some absolute a" > 0. 

E[p{Zr+l , Zo)] 



>E 



deg'c{Zr)-A{p{Zr.,Zo)) ( ^^ t 

degciZr) V 3 



, A{p{Zr,Zo)) ^ 

H : — i .„ ^ {p{Zr, Zo) - t) 



deiciZr) 



A{p{Zr,Zo)) 

Aeg'ciZr) 
t 8t /150rt\*/^ fd 



t 8t /150rt\*/^ /d 



[p{Zr,Zo)] + --- 



[—) 




vac 



,, t 8t /IbOrtV^^ fd 



d / 



As in the proof of Theorem 6.6, we now contrast this estimate with the fact 
that Hilbert space has Markov type 2. Namely, that for every r, 

a^rt" > C2{Bfrt' > C2{CfrE[p\Zi, Zq)] > E[p\Zr,Zo)] > [Ep{Zr,Zo)f. 



Consequently, 



r- ^, ,^ ^ ,, rt Writ /150t^*/^ 
at^>E[piZr,Zo)]>j 



3k 



d 



Ej 

r>j>l 



t/3 



rt 16n /150ty/^ i+i 
We set r = [36a^] and, as stated above, choose t as the even integer nearest 



to 



5500a- 



to conclude the proof of the desired result. 



D 
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Remark 6.11. It is known that for every 1 < p < 2 the metric space 
{£p, \\x — y\\p ) embeds isometrically into £2 (see [52]). It follows that £p has 
Markov type p with constant 1. We can therefore apply the above arguments 
and conclude that Theorem 6.6 and Theorem 6.10 remain true when dealing 
with embeddings into £p, 1 < p < 2. The only necessary modification is that 
in the upper bound on the Ramsey function a^ should be replaced by a^'^^^^' . 

Acknowledgments. The authors would like to express their gratitude to 
Guy Kindler, Robi Krauthgamer, Avner Magen and Yuri Rabinovich for some 
helpful discussions. 

Institute of Computer Science, Hebrew University, Jerusalem, Israel 
E-mail address: yair@cs.huji.ac.il 

Institute of Computer Science, Hebrew University, Jerusalem, Israel 
E-mail address: nati@cs.huji.ac.il 

Institute of Computer Science, Hebrew University, Jerusalem, Israel 
Current address: Computer Science Division, The Open University of Israel, 
Ra'anana, Israel 
E-mail address: mendelma@gmail.com 

Theory Group, Microsoft Research, Redmond, WA 
E-mail address: anaor@microsoft.com 

References 

[1] N. Alon, Eigenvalues and expanders, Combinatorica 6 (1986), 83-96. 

[2] K. Ball, Markov chains, Riesz transforms and Lipschitz maps, Geom. Fund. Anal. 2 

(1992), 137-172. 
[3] Y. Bartal, Probabilistic approximation of metric spaces and its algorithmic applications, 

in The 37th Annual Symposium on Foundations of Computer Science, pp. 184-193, 

1996, IEEE Comput. Sci. Press, Los Alamitos, CA, 1996. 

[4] , On approximating arbitrary metrics by tree metrics, in The 30th Annual ACM 

Symposium on Theory of Computing (STOC '98, Dallas, TX), 161-168, ACM, New 
York, 1999. 

[5] Y. Bartal, B. Bollobas, and M. Mendel, A Ramsey-type theorem for metric spaces and 
its applications for metrical task systems and related problems, in The 42nd Annual 
Symposium on Foundations of Computer Science, pp. 396-405 (Las Vegas, NV, 2001), 
IEEE Computer Soc, Los Alamitos, CA, 2001. 

[6] Y. Bartal, N. Linial, M. Mendel, and A. Naor, Limitations to Ffochct's metric embed- 
ding method, Israel J. Math., to appear. 

[7] , Low dimensional embeddings of ultrametrics, European J. of Combinatorics 25 

(2004), 87-92. 

[8] , On some low distortion metric Ramsey problems, Discrete and Computational 

Ceorn. 33 (2005), 27-45. 

[9] , On metric Ramsey- type phenomena (Conference version) 35th Annual ACM 

Symposium on Theory of Computing, pp. 463-472, ACM, New York, 2003. 

[10] Y. Bartal and M. Mendel, Multi-embedding and path-approximation of metric spaces, 
in Proc. First Annual ACM-SIAM Symposium on Discrete Algorithms, pp. 424-433 
(Baltimore, MD, 2003), ACM, New York, 2003. 



708 YAIR BARTAL, NATHAN LINIAL, MANOR MENDEL, AND ASSAF NAOR 

[11] M. Ben-Or and N. Linial, Collective Coin Flipping (S. Micali, ed.), Academic Press, 
New York, pp. 91-115, 1989. 

[12] Y. Benyamini and J. Lindenstrauss, Geometric Nonlinear Functional Analysts. Vol. 1, 
Amer. Math. Soc. Colloq. Publ. 48, A. M. S., Providence, RI, 2000. 

[13] N. Biggs, Algebraic Craph Theory, Cambridge Tracts in Math. 67, Cambridge Univ. 
Press, Cambridge, 1974. 

[14] A. Blum, H. Karloff, Y. Rabani, and M. Saks, A decomposition theorem for task sys- 
tems and bounds for randomized server problems, SI AM J. Comput. 30 (2000), 1624- 
1661 (electronic). 

[15] J. BouRGAiN, On Lipschitz embedding of finite metric spaces in Hilbert space, Israel J. 
Math. 52 (1985), 46-52. 

[16] , The metrical interpretation of superrefiexivity in Banach spaces, Israel J. Math. 

56 (1986), 222-230. 

[17] J. Bourgain, T. Figiel, and V. Milman, On Hilbcrtian subsets of finite metric spaces, 
Israel J. Math. 55 (1986), 147-152. 

[18] J. Bourgain, V. Milman, and H. Wolfson, On types of metric spaces, Transl. Amer. 
Math. Soc. 294 (1986), 295-317. 

[19] B. Brinkman and M. Charikar, On the impossibility of dimension reduction in £i, in 
Proc. 44th Annual IEEE Conference on Foundations of Computer Science, pp. 514-523, 
ACM, New York, 2003. 

[20] F. R. K. Chung, Diameters and eigenvalues, J. Amer. Math. Soc. 2 (1989), 187-196. 

[21] P. Delsarte, An algebraic approach to the association schemes of coding theory. Philips 
Res. Rep. Suppl. 10 (1973), vi-|-97. 

[22] M. M. Deza and M. Laurent, Geometry of Cuts and Metrics, Springer- Verlag, New 
York, 1997. 

[23] J. DiESTEL, H. Jarchow, and A. Tonge, Absolutely Summing Operators, Cambridge Univ. 
Press, Cambridge, 1995. 

[24] A. DvoRETZKY, Some results on convex bodies and Banach spaces, in Proc. Internat. 
Sympos. Linear Spaces (Jerusalem, 1960), pp. 123-160, Jerusalem Academic Press, 
Jerusalem, 1961. 

[25] P. Enflo, On the nonexistence of uniform homeomorphisms between Lp-spaces, Arkiv 
Mat. 8 (1969), 103-105. 

[26] U. Feige, Approximating the bandwidth via volume respecting embeddings, J. Comput. 
System Set. 60 (2000), 510-539, 30th Annual ACM Symposium on Theory of Computing 
(Dallas, TX, 1998). 

[27] T. FiGiEL, J. Lindenstrauss, and V. D. Milman, The dimension of almost spherical sec- 
tions of convex bodies, Acta Math. 139 (1977), 53-94. 

[28] J. Fakcharoenphol, S. Rao, and K. Talwar, A tight bound on approximating arbitrary 
metrics by tree metrics, in Proc. 35th Annual ACM Symposium on Theory of Computing 
(2003), 448-455. 

[29] R. L. Graham, B. L. Rothschild, and J. H. Spencer, Ramsey Theory, second edition, 
John Wiley & Sons Inc., New York, 1990. 

[30] P. Indyk, Algorithmic applications of low-distortion geometric embeddings, in 42nd 
Annual Symposium on Foundations of Computer Science, pp. 10-33 (Las Vegas, NV, 
2001), IEEE Computer Soc, Los Alamitos, CA, 2001. 



ON METRIC RAMSEY- TYPE PHENOMENA 709 

[31] W. B. Johnson and J. Lindenstrauss, Extensions of Lipschitz mappings into a 
Hilbert space, in Conference on Modern Analysts and Probability (1982), pp. 189-206, 
A. M. S., Providence, RI, 1984. 

[32] H. Karloff, Y. Rabani, and Y. Ravid, Lower bounds for randomized fc-server and 
motion-planning algorithms, SI AM J. Comput. 23 (1994), 293-312. 

[33] J. R. Lee and A. Naor, Embedding the diamond graph in Lp and dimension reduction 
in Li, Geometric Funct. Anal. 14 (2004), 745-747. 

[34] A. J. Lemin, Isometric embedding of ultrametric (non-Archimedean) spaces in Hilbert 
space and Lebesgue space, in p-adic Functional Analysts (Loannina, 2000), Lecture 
Notes in Pure and Appl. Math. 222 (2001), 203-218, Dekker, New York. 

[35] Vladimir L Levenshtein, Krawtchouk polynomials and universal bounds for codes and 
designs in Hamming spaces, IEEE Trans. Inform. Theory 41 (1995), 1303-1321. 

[36] N. LiNiAL, Finite metric spaces — combinatorics, geometry and algorithms, Proc. of the 
/CM (Beijing, 2002), 573-586, Higher Ed. Press, Beijing, 2002. 

[37] N. LiNiAL, E. London, and Y. Rabinovich, The geometry of graphs and some of its 
algorithmic applications, Combinatorica 15 (1995), 215-245. 

[38] N. Linial, a. Magen, and A. Naor, Girth and euclidean distortion, Geom. Funct. Anal. 
12 (2002), 380-394. 

[39] L. H. LooMis and H. Whitney, An inequality related to the isoperimetric inequality. 
Bull. Amer. Math. Soc. 55 (1949), 961-962. 

[40] F. J. MacWilliams and N. J. A. Sloane, The Theory of Error- Gorrecting Godes. I, 
North-Holland Mathematical Library 16, North-Holland Publ. Co., Amsterdam, 1977. 

[41] J. Matousek, On embedding expanders into lp spaces, Israel J. Math. 102 (1997), 189- 
197. 

[42] , Lectures on Discrete Geometry, Springer- Verlag, New York, 2002. 

[43] B. Maurey and G. Pisier, Series de variables aleatoires vectorielles independantes et 
proprietes geometriques des espaces de Banach, Studia Math. 58 (1976), 45-90. 

[44] V. D. Milman, a new proof of A. Dvoretzky's theorem on cross-sections of convex bodies. 
Funk. Anal, i Prtlozen 5 (1971), 28-37. 

[45] V. Milman and G. Schechtman, An "isomorphic" version of Dvoretzky's theorem. II, in 
Gonvex Geometric Analysis (Berkeley, CA, 1996), pp. 159-164, Cambridge Univ. Press, 
Cambridge, 1999. 

[46] , Asymptotic Theory of Finite- Dimensional Normed Spaces (with an appendix 

by M. Gromov), Springer- Verlag, New York, 1986. 

[47] , An "isomorphic" version of Dvoretzky's theorem, C. R. Acad. Sci. Paris Sir. 

I Math. 321 (1995), 541-544. 

[48] G. PisiER, The Volume of Gonvex Bodies and Banach Space Geometry, Cambridge Univ. 
Press, Cambridge, 1989. 

[49] F. P. Ramsey, On a problem of formal logic, Proc. London Math. Soc. 48 (1930), 122- 
160. 

[50] S. Rao, Small distortion and volume-preserving embeddings for planar and Euclidean 
metrics, in Proc. of the Fifteenth Annual Symposium on Computational Geometry (Mi- 
ami Beach, FL, 1999), pp. 300-306, ACM, New York, 1999 (electronic). 

[51] N. Robertson and P. D. Seymour, Graph minors. VIII. A Kuratowski theorem for gen- 
eral surfaces, J. Gombin. Theory Ser. B 48 (1990), 255-288. 



710 YAIR BARTAL, NATHAN LINIAL, MANOR MENDEL, AND ASSAF NAOR 

[52] J. H. Wells and L. R. Williams, Embeddings and Extensions in Analysis, Ergebntsse 
der Mathematik und ihrer Grenzgebiete, Band 84, Springer- Verlag, New York, 1975. 



Yair Bartal, Institute of Computer Science, Hebrew University, Jerusalem 

91904, Israel. 

yair@cs.huji.ac.il 

Nathan Linial, Institute of Computer Science, Hebrew University, Jerusalem 

91904, Israel. 

nati@cs.huji.ac.il 

Manor Mendel, Institute of Computer Science, Hebrew University, Jerusalem 

91904, Israel. 

mendelma@cs.huji.ac.il 

Assaf Naor, Theory Group, Microsoft Research, One Microsoft Way 113/2131, 

Redmond WA 98052-6399, USA. 

anaor@microsoft.com 



2000 AMS Mathematics Subject Classification: 52C45, 05C55, 54E40, 05C12, 
54E40. 

(Received December 4, 2002) 



